Overview

Dataset statistics

Number of variables25
Number of observations400277
Missing cells2942109
Missing cells (%)29.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory545.5 MiB
Average record size in memory1.4 KiB

Variable types

Categorical23
Numeric2

Warnings

object_description has a high cardinality: 602 distinct values High cardinality
text_2 has a high cardinality: 301 distinct values High cardinality
subfund_description has a high cardinality: 274 distinct values High cardinality
job_title_description has a high cardinality: 3516 distinct values High cardinality
text_4 has a high cardinality: 244 distinct values High cardinality
sub_object_description has a high cardinality: 182 distinct values High cardinality
location_description has a high cardinality: 354 distinct values High cardinality
function_description has a high cardinality: 687 distinct values High cardinality
facility_or_department has a high cardinality: 179 distinct values High cardinality
position_extra has a high cardinality: 580 distinct values High cardinality
program_description has a high cardinality: 421 distinct values High cardinality
fund_description has a high cardinality: 141 distinct values High cardinality
text_1 has a high cardinality: 1423 distinct values High cardinality
fte is highly correlated with totalHigh correlation
total is highly correlated with fteHigh correlation
fte is highly correlated with totalHigh correlation
total is highly correlated with fteHigh correlation
use is highly correlated with operating_status and 7 other fieldsHigh correlation
pre_k is highly correlated with text_3 and 2 other fieldsHigh correlation
operating_status is highly correlated with use and 7 other fieldsHigh correlation
text_3 is highly correlated with use and 7 other fieldsHigh correlation
position_type is highly correlated with use and 6 other fieldsHigh correlation
reporting is highly correlated with use and 7 other fieldsHigh correlation
object_type is highly correlated with use and 8 other fieldsHigh correlation
function is highly correlated with use and 7 other fieldsHigh correlation
student_type is highly correlated with use and 8 other fieldsHigh correlation
sharing is highly correlated with use and 7 other fieldsHigh correlation
use is highly correlated with operating_status and 4 other fieldsHigh correlation
pre_k is highly correlated with text_3 and 1 other fieldsHigh correlation
operating_status is highly correlated with use and 5 other fieldsHigh correlation
text_3 is highly correlated with pre_k and 2 other fieldsHigh correlation
position_type is highly correlated with use and 3 other fieldsHigh correlation
reporting is highly correlated with use and 7 other fieldsHigh correlation
object_type is highly correlated with operating_status and 2 other fieldsHigh correlation
function is highly correlated with use and 5 other fieldsHigh correlation
student_type is highly correlated with pre_k and 1 other fieldsHigh correlation
sharing is highly correlated with use and 4 other fieldsHigh correlation
object_description has 24784 (6.2%) missing values Missing
text_2 has 312060 (78.0%) missing values Missing
subfund_description has 93422 (23.3%) missing values Missing
job_title_description has 107534 (26.9%) missing values Missing
text_3 has 291125 (72.7%) missing values Missing
text_4 has 346531 (86.6%) missing values Missing
sub_object_description has 308674 (77.1%) missing values Missing
location_description has 238223 (59.5%) missing values Missing
fte has 274206 (68.5%) missing values Missing
function_description has 58082 (14.5%) missing values Missing
facility_or_department has 346391 (86.5%) missing values Missing
position_extra has 135513 (33.9%) missing values Missing
total has 4555 (1.1%) missing values Missing
program_description has 95617 (23.9%) missing values Missing
fund_description has 197400 (49.3%) missing values Missing
text_1 has 107992 (27.0%) missing values Missing
total is highly skewed (γ1 = 100.3197995) Skewed
fte has 31338 (7.8%) zeros Zeros

Reproduction

Analysis started2021-09-28 21:55:08.275695
Analysis finished2021-09-28 21:56:09.670033
Duration1 minute and 1.39 second
Software versionpandas-profiling v3.0.0
Download configurationconfig.json

Variables

function
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct37
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size29.4 MiB
Teacher Compensation
86354 
Substitute Compensation
62215 
NO_LABEL
59579 
Aides Compensation
19858 
Instructional Materials & Supplies
19711 
Other values (32)
152560 

Length

Max length47
Median length20
Mean length19.92992852
Min length5

Characters and Unicode

Total characters7977492
Distinct characters45
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTeacher Compensation
2nd rowNO_LABEL
3rd rowTeacher Compensation
4th rowSubstitute Compensation
5th rowSubstitute Compensation

Common Values

ValueCountFrequency (%)
Teacher Compensation86354
21.6%
Substitute Compensation62215
15.5%
NO_LABEL59579
14.9%
Aides Compensation19858
 
5.0%
Instructional Materials & Supplies19711
 
4.9%
Facilities & Maintenance19617
 
4.9%
Professional Development19102
 
4.8%
Student Transportation14371
 
3.6%
Food Services14203
 
3.5%
School Administration13055
 
3.3%
Other values (27)72212
18.0%

Length

2021-09-28T14:56:10.128303image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
compensation169196
19.4%
teacher86354
 
9.9%
82431
 
9.4%
substitute62215
 
7.1%
no_label59579
 
6.8%
development26798
 
3.1%
services23373
 
2.7%
aides19858
 
2.3%
materials19711
 
2.3%
instructional19711
 
2.3%
Other values (58)303609
34.8%

Most occurring characters

ValueCountFrequency (%)
e790630
 
9.9%
t655988
 
8.2%
n654098
 
8.2%
i602223
 
7.5%
o597031
 
7.5%
a522072
 
6.5%
472558
 
5.9%
s452959
 
5.7%
r297968
 
3.7%
p271889
 
3.4%
Other values (35)2660076
33.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter6206204
77.8%
Uppercase Letter1150038
 
14.4%
Space Separator472558
 
5.9%
Other Punctuation86953
 
1.1%
Connector Punctuation59579
 
0.7%
Dash Punctuation2160
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e790630
12.7%
t655988
10.6%
n654098
10.5%
i602223
9.7%
o597031
9.6%
a522072
8.4%
s452959
7.3%
r297968
 
4.8%
p271889
 
4.4%
m264533
 
4.3%
Other values (12)1096813
17.7%
Uppercase Letter
ValueCountFrequency (%)
C186837
16.2%
S158227
13.8%
L123797
10.8%
T117475
10.2%
A97494
8.5%
E83643
7.3%
B62197
 
5.4%
O62151
 
5.4%
N61382
 
5.3%
M48253
 
4.2%
Other values (8)148582
12.9%
Other Punctuation
ValueCountFrequency (%)
&82431
94.8%
,4522
 
5.2%
Space Separator
ValueCountFrequency (%)
472558
100.0%
Connector Punctuation
ValueCountFrequency (%)
_59579
100.0%
Dash Punctuation
ValueCountFrequency (%)
-2160
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin7356242
92.2%
Common621250
 
7.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e790630
 
10.7%
t655988
 
8.9%
n654098
 
8.9%
i602223
 
8.2%
o597031
 
8.1%
a522072
 
7.1%
s452959
 
6.2%
r297968
 
4.1%
p271889
 
3.7%
m264533
 
3.6%
Other values (30)2246851
30.5%
Common
ValueCountFrequency (%)
472558
76.1%
&82431
 
13.3%
_59579
 
9.6%
,4522
 
0.7%
-2160
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII7977492
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e790630
 
9.9%
t655988
 
8.2%
n654098
 
8.2%
i602223
 
7.5%
o597031
 
7.5%
a522072
 
6.5%
472558
 
5.9%
s452959
 
5.7%
r297968
 
3.7%
p271889
 
3.4%
Other values (35)2660076
33.3%

use
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size25.6 MiB
Instruction
203608 
NO_LABEL
78712 
O&M
45868 
ISPD
26118 
Pupil Services & Enrichment
23779 
Other values (3)
22192 

Length

Max length27
Median length11
Mean length10.05295083
Min length3

Characters and Unicode

Total characters4023965
Distinct characters34
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowInstruction
2nd rowNO_LABEL
3rd rowInstruction
4th rowInstruction
5th rowInstruction

Common Values

ValueCountFrequency (%)
Instruction203608
50.9%
NO_LABEL78712
 
19.7%
O&M45868
 
11.5%
ISPD26118
 
6.5%
Pupil Services & Enrichment23779
 
5.9%
Leadership15715
 
3.9%
Business Services6120
 
1.5%
Untracked Budget Set-Aside357
 
0.1%

Length

2021-09-28T14:56:10.321156image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2021-09-28T14:56:10.385831image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
ValueCountFrequency (%)
instruction203608
42.6%
no_label78712
 
16.5%
o&m45868
 
9.6%
services29899
 
6.2%
ispd26118
 
5.5%
pupil23779
 
5.0%
23779
 
5.0%
enrichment23779
 
5.0%
leadership15715
 
3.3%
business6120
 
1.3%
Other values (3)1071
 
0.2%

Most occurring characters

ValueCountFrequency (%)
n461251
 
11.5%
t432066
 
10.7%
i303257
 
7.5%
r273358
 
6.8%
s267939
 
6.7%
c257643
 
6.4%
u233864
 
5.8%
I229726
 
5.7%
o203608
 
5.1%
L173139
 
4.3%
Other values (24)1188114
29.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2745558
68.2%
Uppercase Letter1051520
 
26.1%
Connector Punctuation78712
 
2.0%
Space Separator78171
 
1.9%
Other Punctuation69647
 
1.7%
Dash Punctuation357
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n461251
16.8%
t432066
15.7%
i303257
11.0%
r273358
10.0%
s267939
9.8%
c257643
9.4%
u233864
8.5%
o203608
7.4%
e122555
 
4.5%
p39494
 
1.4%
Other values (8)150523
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
I229726
21.8%
L173139
16.5%
O124580
11.8%
E102491
9.7%
B85189
 
8.1%
A79069
 
7.5%
N78712
 
7.5%
S56374
 
5.4%
P49897
 
4.7%
M45868
 
4.4%
Other values (2)26475
 
2.5%
Connector Punctuation
ValueCountFrequency (%)
_78712
100.0%
Other Punctuation
ValueCountFrequency (%)
&69647
100.0%
Space Separator
ValueCountFrequency (%)
78171
100.0%
Dash Punctuation
ValueCountFrequency (%)
-357
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin3797078
94.4%
Common226887
 
5.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
n461251
12.1%
t432066
11.4%
i303257
 
8.0%
r273358
 
7.2%
s267939
 
7.1%
c257643
 
6.8%
u233864
 
6.2%
I229726
 
6.1%
o203608
 
5.4%
L173139
 
4.6%
Other values (20)961227
25.3%
Common
ValueCountFrequency (%)
_78712
34.7%
78171
34.5%
&69647
30.7%
-357
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII4023965
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n461251
 
11.5%
t432066
 
10.7%
i303257
 
7.5%
r273358
 
6.8%
s267939
 
6.7%
c257643
 
6.4%
u233864
 
5.8%
I229726
 
5.7%
o203608
 
5.1%
L173139
 
4.3%
Other values (24)1188114
29.5%

sharing
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size27.5 MiB
School Reported
254433 
NO_LABEL
59376 
Shared Services
42641 
School on Central Budgets
26849 
Leadership & Management
 
16978

Length

Max length25
Median length15
Mean length14.97172458
Min length8

Characters and Unicode

Total characters5992837
Distinct characters30
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSchool Reported
2nd rowNO_LABEL
3rd rowSchool Reported
4th rowSchool Reported
5th rowSchool Reported

Common Values

ValueCountFrequency (%)
School Reported254433
63.6%
NO_LABEL59376
 
14.8%
Shared Services42641
 
10.7%
School on Central Budgets26849
 
6.7%
Leadership & Management16978
 
4.2%

Length

2021-09-28T14:56:10.562795image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2021-09-28T14:56:10.630013image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
ValueCountFrequency (%)
school281282
34.6%
reported254433
31.3%
no_label59376
 
7.3%
shared42641
 
5.3%
services42641
 
5.3%
on26849
 
3.3%
central26849
 
3.3%
budgets26849
 
3.3%
leadership16978
 
2.1%
16978
 
2.1%

Most occurring characters

ValueCountFrequency (%)
o843846
14.1%
e758399
12.7%
411577
 
6.9%
r383542
 
6.4%
S366564
 
6.1%
h340901
 
5.7%
d340901
 
5.7%
t325109
 
5.4%
c323923
 
5.4%
l308131
 
5.1%
Other values (20)1589944
26.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter4380623
73.1%
Uppercase Letter1124283
 
18.8%
Space Separator411577
 
6.9%
Connector Punctuation59376
 
1.0%
Other Punctuation16978
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o843846
19.3%
e758399
17.3%
r383542
8.8%
h340901
7.8%
d340901
7.8%
t325109
 
7.4%
c323923
 
7.4%
l308131
 
7.0%
p271411
 
6.2%
a120424
 
2.7%
Other values (7)364036
8.3%
Uppercase Letter
ValueCountFrequency (%)
S366564
32.6%
R254433
22.6%
L135730
 
12.1%
B86225
 
7.7%
N59376
 
5.3%
O59376
 
5.3%
A59376
 
5.3%
E59376
 
5.3%
C26849
 
2.4%
M16978
 
1.5%
Space Separator
ValueCountFrequency (%)
411577
100.0%
Connector Punctuation
ValueCountFrequency (%)
_59376
100.0%
Other Punctuation
ValueCountFrequency (%)
&16978
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin5504906
91.9%
Common487931
 
8.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
o843846
15.3%
e758399
13.8%
r383542
 
7.0%
S366564
 
6.7%
h340901
 
6.2%
d340901
 
6.2%
t325109
 
5.9%
c323923
 
5.9%
l308131
 
5.6%
p271411
 
4.9%
Other values (17)1242179
22.6%
Common
ValueCountFrequency (%)
411577
84.4%
_59376
 
12.2%
&16978
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII5992837
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o843846
14.1%
e758399
12.7%
411577
 
6.9%
r383542
 
6.4%
S366564
 
6.1%
h340901
 
5.7%
d340901
 
5.7%
t325109
 
5.4%
c323923
 
5.4%
l308131
 
5.1%
Other values (20)1589944
26.5%

reporting
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size24.5 MiB
School
257258 
Non-School
86320 
NO_LABEL
56699 

Length

Max length10
Median length6
Mean length7.145901463
Min length6

Characters and Unicode

Total characters2860340
Distinct characters14
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSchool
2nd rowNO_LABEL
3rd rowSchool
4th rowSchool
5th rowSchool

Common Values

ValueCountFrequency (%)
School257258
64.3%
Non-School86320
 
21.6%
NO_LABEL56699
 
14.2%

Length

2021-09-28T14:56:10.805845image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2021-09-28T14:56:10.865219image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
ValueCountFrequency (%)
school257258
64.3%
non-school86320
 
21.6%
no_label56699
 
14.2%

Most occurring characters

ValueCountFrequency (%)
o773476
27.0%
S343578
12.0%
c343578
12.0%
h343578
12.0%
l343578
12.0%
N143019
 
5.0%
L113398
 
4.0%
n86320
 
3.0%
-86320
 
3.0%
O56699
 
2.0%
Other values (4)226796
 
7.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1890530
66.1%
Uppercase Letter826791
28.9%
Dash Punctuation86320
 
3.0%
Connector Punctuation56699
 
2.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S343578
41.6%
N143019
17.3%
L113398
 
13.7%
O56699
 
6.9%
A56699
 
6.9%
B56699
 
6.9%
E56699
 
6.9%
Lowercase Letter
ValueCountFrequency (%)
o773476
40.9%
c343578
18.2%
h343578
18.2%
l343578
18.2%
n86320
 
4.6%
Connector Punctuation
ValueCountFrequency (%)
_56699
100.0%
Dash Punctuation
ValueCountFrequency (%)
-86320
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2717321
95.0%
Common143019
 
5.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o773476
28.5%
S343578
12.6%
c343578
12.6%
h343578
12.6%
l343578
12.6%
N143019
 
5.3%
L113398
 
4.2%
n86320
 
3.2%
O56699
 
2.1%
A56699
 
2.1%
Other values (2)113398
 
4.2%
Common
ValueCountFrequency (%)
-86320
60.4%
_56699
39.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII2860340
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o773476
27.0%
S343578
12.0%
c343578
12.0%
h343578
12.0%
l343578
12.0%
N143019
 
5.0%
L113398
 
4.0%
n86320
 
3.0%
-86320
 
3.0%
O56699
 
2.0%
Other values (4)226796
 
7.9%

student_type
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct9
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size25.7 MiB
Unspecified
223026 
NO_LABEL
99871 
Special Education
42024 
Poverty
 
17845
ELL
 
6752
Other values (4)
 
10759

Length

Max length17
Median length11
Mean length10.41966438
Min length3

Characters and Unicode

Total characters4170752
Distinct characters31
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNO_LABEL
2nd rowNO_LABEL
3rd rowUnspecified
4th rowUnspecified
5th rowUnspecified

Common Values

ValueCountFrequency (%)
Unspecified223026
55.7%
NO_LABEL99871
25.0%
Special Education42024
 
10.5%
Poverty17845
 
4.5%
ELL6752
 
1.7%
PreK5561
 
1.4%
At Risk3132
 
0.8%
Gifted1595
 
0.4%
Alternative471
 
0.1%

Length

2021-09-28T14:56:11.007847image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2021-09-28T14:56:11.072056image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
ValueCountFrequency (%)
unspecified223026
50.1%
no_label99871
22.4%
special42024
 
9.4%
education42024
 
9.4%
poverty17845
 
4.0%
ell6752
 
1.5%
prek5561
 
1.2%
at3132
 
0.7%
risk3132
 
0.7%
gifted1595
 
0.4%

Most occurring characters

ValueCountFrequency (%)
i535298
12.8%
e514019
12.3%
c307074
 
7.4%
d266645
 
6.4%
n265521
 
6.4%
p265050
 
6.4%
s226158
 
5.4%
f224621
 
5.4%
U223026
 
5.3%
L213246
 
5.1%
Other values (21)1130094
27.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2962001
71.0%
Uppercase Letter1063724
 
25.5%
Connector Punctuation99871
 
2.4%
Space Separator45156
 
1.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i535298
18.1%
e514019
17.4%
c307074
10.4%
d266645
9.0%
n265521
9.0%
p265050
8.9%
s226158
7.6%
f224621
7.6%
a84519
 
2.9%
t65538
 
2.2%
Other values (7)207558
 
7.0%
Uppercase Letter
ValueCountFrequency (%)
U223026
21.0%
L213246
20.0%
E148647
14.0%
A103474
9.7%
N99871
9.4%
O99871
9.4%
B99871
9.4%
S42024
 
4.0%
P23406
 
2.2%
K5561
 
0.5%
Other values (2)4727
 
0.4%
Connector Punctuation
ValueCountFrequency (%)
_99871
100.0%
Space Separator
ValueCountFrequency (%)
45156
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin4025725
96.5%
Common145027
 
3.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
i535298
13.3%
e514019
12.8%
c307074
 
7.6%
d266645
 
6.6%
n265521
 
6.6%
p265050
 
6.6%
s226158
 
5.6%
f224621
 
5.6%
U223026
 
5.5%
L213246
 
5.3%
Other values (19)985067
24.5%
Common
ValueCountFrequency (%)
_99871
68.9%
45156
31.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII4170752
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i535298
12.8%
e514019
12.3%
c307074
 
7.4%
d266645
 
6.4%
n265521
 
6.4%
p265050
 
6.4%
s226158
 
5.4%
f224621
 
5.4%
U223026
 
5.3%
L213246
 
5.1%
Other values (21)1130094
27.1%

position_type
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct25
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size25.1 MiB
Teacher
102788 
NO_LABEL
97607 
Substitute
63515 
Other
37614 
TA
22799 
Other values (20)
75954 

Length

Max length23
Median length8
Mean length8.687486416
Min length2

Characters and Unicode

Total characters3477401
Distinct characters44
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTeacher
2nd rowNO_LABEL
3rd rowTeacher
4th rowSubstitute
5th rowTeacher

Common Values

ValueCountFrequency (%)
Teacher102788
25.7%
NO_LABEL97607
24.4%
Substitute63515
15.9%
Other37614
 
9.4%
TA22799
 
5.7%
Non-Position20996
 
5.2%
Custodian9713
 
2.4%
Sec/Clerk/Other Admin8814
 
2.2%
Coordinator/Manager7407
 
1.9%
Instructional Coach5148
 
1.3%
Other values (15)23876
 
6.0%

Length

2021-09-28T14:56:11.258740image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
teacher102788
23.9%
no_label97607
22.7%
substitute63515
14.8%
other37614
 
8.7%
ta22799
 
5.3%
non-position20996
 
4.9%
custodian9713
 
2.3%
sec/clerk/other8814
 
2.0%
admin8814
 
2.0%
coordinator/manager7407
 
1.7%
Other values (24)50366
11.7%

Most occurring characters

ValueCountFrequency (%)
e362433
 
10.4%
t294837
 
8.5%
r213887
 
6.2%
L197189
 
5.7%
i170030
 
4.9%
h166880
 
4.8%
a166039
 
4.8%
u155841
 
4.5%
c147152
 
4.2%
O144707
 
4.2%
Other values (34)1458406
41.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2206040
63.4%
Uppercase Letter1090651
31.4%
Connector Punctuation97607
 
2.8%
Other Punctuation30781
 
0.9%
Space Separator30156
 
0.9%
Dash Punctuation20996
 
0.6%
Open Punctuation585
 
< 0.1%
Close Punctuation585
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e362433
16.4%
t294837
13.4%
r213887
9.7%
i170030
7.7%
h166880
7.6%
a166039
7.5%
u155841
7.1%
c147152
6.7%
o134127
 
6.1%
s114663
 
5.2%
Other values (12)280151
12.7%
Uppercase Letter
ValueCountFrequency (%)
L197189
18.1%
O144707
13.3%
A133286
12.2%
T128384
11.8%
N120571
11.1%
E98192
9.0%
B97607
8.9%
S79827
7.3%
C41581
 
3.8%
P27421
 
2.5%
Other values (6)21886
 
2.0%
Connector Punctuation
ValueCountFrequency (%)
_97607
100.0%
Dash Punctuation
ValueCountFrequency (%)
-20996
100.0%
Other Punctuation
ValueCountFrequency (%)
/30781
100.0%
Space Separator
ValueCountFrequency (%)
30156
100.0%
Open Punctuation
ValueCountFrequency (%)
(585
100.0%
Close Punctuation
ValueCountFrequency (%)
)585
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin3296691
94.8%
Common180710
 
5.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e362433
 
11.0%
t294837
 
8.9%
r213887
 
6.5%
L197189
 
6.0%
i170030
 
5.2%
h166880
 
5.1%
a166039
 
5.0%
u155841
 
4.7%
c147152
 
4.5%
O144707
 
4.4%
Other values (28)1277696
38.8%
Common
ValueCountFrequency (%)
_97607
54.0%
/30781
 
17.0%
30156
 
16.7%
-20996
 
11.6%
(585
 
0.3%
)585
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII3477401
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e362433
 
10.4%
t294837
 
8.5%
r213887
 
6.2%
L197189
 
5.7%
i170030
 
4.9%
h166880
 
4.8%
a166039
 
4.8%
u155841
 
4.5%
c147152
 
4.2%
O144707
 
4.2%
Other values (34)1458406
41.9%

object_type
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size28.4 MiB
Base Salary/Compensation
97670 
Benefits
85467 
NO_LABEL
69644 
Other Compensation/Stipend
61685 
Supplies/Materials
31935 
Other values (6)
53876 

Length

Max length27
Median length20
Mean length17.33844563
Min length8

Characters and Unicode

Total characters6940181
Distinct characters37
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNO_LABEL
2nd rowNO_LABEL
3rd rowBase Salary/Compensation
4th rowBenefits
5th rowSubstitute Compensation

Common Values

ValueCountFrequency (%)
Base Salary/Compensation97670
24.4%
Benefits85467
21.4%
NO_LABEL69644
17.4%
Other Compensation/Stipend61685
15.4%
Supplies/Materials31935
 
8.0%
Substitute Compensation27357
 
6.8%
Contracted Services7512
 
1.9%
Other Non-Compensation6297
 
1.6%
Travel & Conferences5030
 
1.3%
Equipment & Equipment Lease4460
 
1.1%

Length

2021-09-28T14:56:11.431921image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
salary/compensation97670
15.6%
base97670
15.6%
benefits85467
13.7%
no_label69644
11.2%
other67982
10.9%
compensation/stipend61685
9.9%
supplies/materials31935
 
5.1%
compensation27357
 
4.4%
substitute27357
 
4.4%
9490
 
1.5%
Other values (8)47981
7.7%

Most occurring characters

ValueCountFrequency (%)
e749443
 
10.8%
n569179
 
8.2%
a566891
 
8.2%
t555753
 
8.0%
s487595
 
7.0%
i457480
 
6.6%
o404857
 
5.8%
p327484
 
4.7%
B252781
 
3.6%
S226159
 
3.3%
Other values (27)2342559
33.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter5202860
75.0%
Uppercase Letter1233419
 
17.8%
Space Separator223961
 
3.2%
Other Punctuation204000
 
2.9%
Connector Punctuation69644
 
1.0%
Dash Punctuation6297
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e749443
14.4%
n569179
10.9%
a566891
10.9%
t555753
10.7%
s487595
9.4%
i457480
8.8%
o404857
7.8%
p327484
6.3%
r222671
 
4.3%
m201929
 
3.9%
Other values (10)659578
12.7%
Uppercase Letter
ValueCountFrequency (%)
B252781
20.5%
S226159
18.3%
C205551
16.7%
L143748
11.7%
O137626
11.2%
E78564
 
6.4%
N75941
 
6.2%
A69644
 
5.6%
M31935
 
2.6%
T5030
 
0.4%
Other values (2)6440
 
0.5%
Other Punctuation
ValueCountFrequency (%)
/194510
95.3%
&9490
 
4.7%
Connector Punctuation
ValueCountFrequency (%)
_69644
100.0%
Space Separator
ValueCountFrequency (%)
223961
100.0%
Dash Punctuation
ValueCountFrequency (%)
-6297
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin6436279
92.7%
Common503902
 
7.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e749443
11.6%
n569179
 
8.8%
a566891
 
8.8%
t555753
 
8.6%
s487595
 
7.6%
i457480
 
7.1%
o404857
 
6.3%
p327484
 
5.1%
B252781
 
3.9%
S226159
 
3.5%
Other values (22)1838657
28.6%
Common
ValueCountFrequency (%)
223961
44.4%
/194510
38.6%
_69644
 
13.8%
&9490
 
1.9%
-6297
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII6940181
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e749443
 
10.8%
n569179
 
8.2%
a566891
 
8.2%
t555753
 
8.0%
s487595
 
7.0%
i457480
 
6.6%
o404857
 
5.8%
p327484
 
4.7%
B252781
 
3.6%
S226159
 
3.3%
Other values (27)2342559
33.8%

pre_k
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size24.8 MiB
NO_LABEL
306425 
Non PreK
81069 
PreK
 
12783

Length

Max length8
Median length8
Mean length7.872258461
Min length4

Characters and Unicode

Total characters3151084
Distinct characters14
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNO_LABEL
2nd rowNO_LABEL
3rd rowNon PreK
4th rowNO_LABEL
5th rowNO_LABEL

Common Values

ValueCountFrequency (%)
NO_LABEL306425
76.6%
Non PreK81069
 
20.3%
PreK12783
 
3.2%

Length

2021-09-28T14:56:11.622082image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2021-09-28T14:56:11.687791image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
ValueCountFrequency (%)
no_label306425
63.7%
prek93852
 
19.5%
non81069
 
16.8%

Most occurring characters

ValueCountFrequency (%)
L612850
19.4%
N387494
12.3%
O306425
9.7%
_306425
9.7%
A306425
9.7%
B306425
9.7%
E306425
9.7%
P93852
 
3.0%
r93852
 
3.0%
e93852
 
3.0%
Other values (4)337059
10.7%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter2413748
76.6%
Lowercase Letter349842
 
11.1%
Connector Punctuation306425
 
9.7%
Space Separator81069
 
2.6%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
L612850
25.4%
N387494
16.1%
O306425
12.7%
A306425
12.7%
B306425
12.7%
E306425
12.7%
P93852
 
3.9%
K93852
 
3.9%
Lowercase Letter
ValueCountFrequency (%)
r93852
26.8%
e93852
26.8%
o81069
23.2%
n81069
23.2%
Connector Punctuation
ValueCountFrequency (%)
_306425
100.0%
Space Separator
ValueCountFrequency (%)
81069
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2763590
87.7%
Common387494
 
12.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
L612850
22.2%
N387494
14.0%
O306425
11.1%
A306425
11.1%
B306425
11.1%
E306425
11.1%
P93852
 
3.4%
r93852
 
3.4%
e93852
 
3.4%
K93852
 
3.4%
Other values (2)162138
 
5.9%
Common
ValueCountFrequency (%)
_306425
79.1%
81069
 
20.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII3151084
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
L612850
19.4%
N387494
12.3%
O306425
9.7%
_306425
9.7%
A306425
9.7%
B306425
9.7%
E306425
9.7%
P93852
 
3.0%
r93852
 
3.0%
e93852
 
3.0%
Other values (4)337059
10.7%

operating_status
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size28.1 MiB
PreK-12 Operating
343578 
Non-Operating
48034 
Operating, Not PreK-12
 
8665

Length

Max length22
Median length17
Mean length16.62822995
Min length13

Characters and Unicode

Total characters6655898
Distinct characters18
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPreK-12 Operating
2nd rowNon-Operating
3rd rowPreK-12 Operating
4th rowPreK-12 Operating
5th rowPreK-12 Operating

Common Values

ValueCountFrequency (%)
PreK-12 Operating343578
85.8%
Non-Operating48034
 
12.0%
Operating, Not PreK-128665
 
2.2%

Length

2021-09-28T14:56:11.844082image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2021-09-28T14:56:11.901201image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
ValueCountFrequency (%)
prek-12352243
46.3%
operating352243
46.3%
non-operating48034
 
6.3%
not8665
 
1.1%

Most occurring characters

ValueCountFrequency (%)
r752520
 
11.3%
e752520
 
11.3%
n448311
 
6.7%
t408942
 
6.1%
-400277
 
6.0%
O400277
 
6.0%
p400277
 
6.0%
a400277
 
6.0%
i400277
 
6.0%
g400277
 
6.0%
Other values (8)1891943
28.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter4020100
60.4%
Uppercase Letter1161462
 
17.5%
Decimal Number704486
 
10.6%
Dash Punctuation400277
 
6.0%
Space Separator360908
 
5.4%
Other Punctuation8665
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r752520
18.7%
e752520
18.7%
n448311
11.2%
t408942
10.2%
p400277
10.0%
a400277
10.0%
i400277
10.0%
g400277
10.0%
o56699
 
1.4%
Uppercase Letter
ValueCountFrequency (%)
O400277
34.5%
P352243
30.3%
K352243
30.3%
N56699
 
4.9%
Decimal Number
ValueCountFrequency (%)
1352243
50.0%
2352243
50.0%
Dash Punctuation
ValueCountFrequency (%)
-400277
100.0%
Space Separator
ValueCountFrequency (%)
360908
100.0%
Other Punctuation
ValueCountFrequency (%)
,8665
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin5181562
77.8%
Common1474336
 
22.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
r752520
14.5%
e752520
14.5%
n448311
8.7%
t408942
7.9%
O400277
7.7%
p400277
7.7%
a400277
7.7%
i400277
7.7%
g400277
7.7%
P352243
6.8%
Other values (3)465641
9.0%
Common
ValueCountFrequency (%)
-400277
27.1%
360908
24.5%
1352243
23.9%
2352243
23.9%
,8665
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII6655898
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r752520
 
11.3%
e752520
 
11.3%
n448311
 
6.7%
t408942
 
6.1%
-400277
 
6.0%
O400277
 
6.0%
p400277
 
6.0%
a400277
 
6.0%
i400277
 
6.0%
g400277
 
6.0%
Other values (8)1891943
28.4%

object_description
Categorical

HIGH CARDINALITY
MISSING

Distinct602
Distinct (%)0.2%
Missing24784
Missing (%)6.2%
Memory size30.8 MiB
EMPLOYEE BENEFITS
47495 
SALARIES OF PART TIME EMPLOYEE
31761 
SALARIES OF REGULAR EMPLOYEES
24319 
CONTRA BENEFITS
 
19381
Salaries And Wages For Teachers And Other Professi
 
18632
Other values (597)
233905 

Length

Max length73
Median length29
Mean length26.82298205
Min length3

Characters and Unicode

Total characters10071842
Distinct characters64
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)< 0.1%

Sample

1st rowCONTRACTOR SERVICES
2nd rowPersonal Services - Teachers
3rd rowEMPLOYEE BENEFITS
4th rowTEACHER COVERAGE FOR TEACHER
5th rowCONTRA BENEFITS

Common Values

ValueCountFrequency (%)
EMPLOYEE BENEFITS47495
 
11.9%
SALARIES OF PART TIME EMPLOYEE31761
 
7.9%
SALARIES OF REGULAR EMPLOYEES24319
 
6.1%
CONTRA BENEFITS19381
 
4.8%
Salaries And Wages For Teachers And Other Professi18632
 
4.7%
ADDITIONAL/EXTRA DUTY PAY/STIP16841
 
4.2%
SUPPLIES13117
 
3.3%
RETIREMENT CONTRIB.13073
 
3.3%
Regular *9270
 
2.3%
Extra Duty Pay/Overtime For Support Personnel9159
 
2.3%
Other values (592)172445
43.1%
(Missing)24784
 
6.2%

Length

2021-09-28T14:56:12.108619image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
salaries93875
 
7.2%
employee80757
 
6.2%
and70269
 
5.4%
benefits67030
 
5.1%
of60248
 
4.6%
services50224
 
3.9%
other50087
 
3.8%
48398
 
3.7%
for45562
 
3.5%
personal37148
 
2.8%
Other values (589)700458
53.7%

Most occurring characters

ValueCountFrequency (%)
1365605
 
13.6%
E884727
 
8.8%
S520193
 
5.2%
e489994
 
4.9%
A430218
 
4.3%
T428553
 
4.3%
O409871
 
4.1%
P392805
 
3.9%
R380175
 
3.8%
I358396
 
3.6%
Other values (54)4411305
43.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter5569093
55.3%
Lowercase Letter3012107
29.9%
Space Separator1365605
 
13.6%
Other Punctuation75841
 
0.8%
Dash Punctuation46294
 
0.5%
Open Punctuation1525
 
< 0.1%
Close Punctuation1377
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E884727
15.9%
S520193
9.3%
A430218
 
7.7%
T428553
 
7.7%
O409871
 
7.4%
P392805
 
7.1%
R380175
 
6.8%
I358396
 
6.4%
L289600
 
5.2%
N240267
 
4.3%
Other values (16)1234288
22.2%
Lowercase Letter
ValueCountFrequency (%)
e489994
16.3%
r354534
11.8%
s327823
10.9%
a279242
9.3%
i200166
 
6.6%
t184497
 
6.1%
n180517
 
6.0%
o179841
 
6.0%
l148807
 
4.9%
c101452
 
3.4%
Other values (16)565234
18.8%
Other Punctuation
ValueCountFrequency (%)
/46236
61.0%
*14598
 
19.2%
.13150
 
17.3%
,1320
 
1.7%
&361
 
0.5%
"120
 
0.2%
'56
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
-46291
> 99.9%
3
 
< 0.1%
Space Separator
ValueCountFrequency (%)
1365605
100.0%
Open Punctuation
ValueCountFrequency (%)
(1525
100.0%
Close Punctuation
ValueCountFrequency (%)
)1377
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin8581200
85.2%
Common1490642
 
14.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
E884727
 
10.3%
S520193
 
6.1%
e489994
 
5.7%
A430218
 
5.0%
T428553
 
5.0%
O409871
 
4.8%
P392805
 
4.6%
R380175
 
4.4%
I358396
 
4.2%
r354534
 
4.1%
Other values (42)3931734
45.8%
Common
ValueCountFrequency (%)
1365605
91.6%
-46291
 
3.1%
/46236
 
3.1%
*14598
 
1.0%
.13150
 
0.9%
(1525
 
0.1%
)1377
 
0.1%
,1320
 
0.1%
&361
 
< 0.1%
"120
 
< 0.1%
Other values (2)59
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII10071839
> 99.9%
Punctuation3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1365605
 
13.6%
E884727
 
8.8%
S520193
 
5.2%
e489994
 
4.9%
A430218
 
4.3%
T428553
 
4.3%
O409871
 
4.1%
P392805
 
3.9%
R380175
 
3.8%
I358396
 
3.6%
Other values (53)4411302
43.8%
Punctuation
ValueCountFrequency (%)
3
100.0%

text_2
Categorical

HIGH CARDINALITY
MISSING

Distinct301
Distinct (%)0.3%
Missing312060
Missing (%)78.0%
Memory size15.7 MiB
TEACHER SUBS
16599 
FOOD SERVICES
5871 
GENERAL EDUCATION
 
4251
TRANSPORTATION
 
3945
CUSTODIAL-SCHOOLS
 
3860
Other values (296)
53691 

Length

Max length48
Median length15
Mean length16.59347971
Min length2

Characters and Unicode

Total characters1463827
Distinct characters55
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)< 0.1%

Sample

1st rowBOND EXPENDITURES
2nd rowTEACHER SUBS
3rd rowTEACHER SUBS
4th rowSPECIAL EDUCATION INSTRUCTION
5th rowTEACHER SUBS

Common Values

ValueCountFrequency (%)
TEACHER SUBS16599
 
4.1%
FOOD SERVICES5871
 
1.5%
GENERAL EDUCATION4251
 
1.1%
TRANSPORTATION3945
 
1.0%
CUSTODIAL-SCHOOLS3860
 
1.0%
TEACHER LEARNING & LEADERSHIP2878
 
0.7%
TEACHER2710
 
0.7%
SEVERE DISABILITIES2610
 
0.7%
MAINTENANCE2483
 
0.6%
AFTERSCHOOL PROGRAMS2264
 
0.6%
Other values (291)40746
 
10.2%
(Missing)312060
78.0%

Length

2021-09-28T14:56:12.352198image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
teacher23073
 
12.5%
subs16599
 
9.0%
services9958
 
5.4%
education8799
 
4.8%
food5871
 
3.2%
5544
 
3.0%
general4372
 
2.4%
transportation3953
 
2.1%
custodial-schools3860
 
2.1%
leadership3149
 
1.7%
Other values (353)99101
53.8%

Most occurring characters

ValueCountFrequency (%)
E179986
12.3%
S135019
 
9.2%
A119477
 
8.2%
T108714
 
7.4%
R98044
 
6.7%
I96667
 
6.6%
96069
 
6.6%
C88085
 
6.0%
O83624
 
5.7%
N75142
 
5.1%
Other values (45)383000
26.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter1346416
92.0%
Space Separator96069
 
6.6%
Other Punctuation9328
 
0.6%
Lowercase Letter6423
 
0.4%
Dash Punctuation5591
 
0.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E179986
13.4%
S135019
10.0%
A119477
 
8.9%
T108714
 
8.1%
R98044
 
7.3%
I96667
 
7.2%
C88085
 
6.5%
O83624
 
6.2%
N75142
 
5.6%
L54036
 
4.0%
Other values (16)307622
22.8%
Lowercase Letter
ValueCountFrequency (%)
a791
12.3%
e690
10.7%
t688
10.7%
r633
9.9%
l530
8.3%
n418
 
6.5%
i403
 
6.3%
p357
 
5.6%
s319
 
5.0%
o301
 
4.7%
Other values (12)1293
20.1%
Other Punctuation
ValueCountFrequency (%)
&4656
49.9%
/4475
48.0%
,136
 
1.5%
"33
 
0.4%
.28
 
0.3%
Space Separator
ValueCountFrequency (%)
96069
100.0%
Dash Punctuation
ValueCountFrequency (%)
-5591
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1352839
92.4%
Common110988
 
7.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
E179986
13.3%
S135019
10.0%
A119477
 
8.8%
T108714
 
8.0%
R98044
 
7.2%
I96667
 
7.1%
C88085
 
6.5%
O83624
 
6.2%
N75142
 
5.6%
L54036
 
4.0%
Other values (38)314045
23.2%
Common
ValueCountFrequency (%)
96069
86.6%
-5591
 
5.0%
&4656
 
4.2%
/4475
 
4.0%
,136
 
0.1%
"33
 
< 0.1%
.28
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1463827
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E179986
12.3%
S135019
 
9.2%
A119477
 
8.2%
T108714
 
7.4%
R98044
 
6.7%
I96667
 
6.6%
96069
 
6.6%
C88085
 
6.0%
O83624
 
5.7%
N75142
 
5.1%
Other values (45)383000
26.2%

subfund_description
Categorical

HIGH CARDINALITY
MISSING

Distinct274
Distinct (%)0.1%
Missing93422
Missing (%)23.3%
Memory size24.5 MiB
GENERAL FUND
123327 
Operations
26895 
FEDERAL GDPG FUND - FY
13562 
Support Services - Instructional Staff
 
10238
Special Instruction
 
10106
Other values (269)
122727 

Length

Max length50
Median length12
Mean length17.03792997
Min length4

Characters and Unicode

Total characters5228174
Distinct characters70
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)< 0.1%

Sample

1st rowBUILDING FUND
2nd rowGENERAL FUND
3rd rowGENERAL FUND
4th rowGENERAL FUND
5th rowLOCAL

Common Values

ValueCountFrequency (%)
GENERAL FUND123327
30.8%
Operations26895
 
6.7%
FEDERAL GDPG FUND - FY 13562
 
3.4%
Support Services - Instructional Staff10238
 
2.6%
Special Instruction10106
 
2.5%
DISTRICT SPECIAL REVENUE FUNDS9975
 
2.5%
LOCAL9393
 
2.3%
ARRA - STIMULUS7458
 
1.9%
Community Services7386
 
1.8%
MILL LEVY5964
 
1.5%
Other values (264)82551
20.6%
(Missing)93422
23.3%

Length

2021-09-28T14:56:12.586952image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
fund151371
19.1%
general125177
 
15.8%
41423
 
5.2%
operations27772
 
3.5%
services23501
 
3.0%
special21061
 
2.7%
gdpg17243
 
2.2%
federal16118
 
2.0%
support15101
 
1.9%
instruction14921
 
1.9%
Other values (399)337329
42.6%

Most occurring characters

ValueCountFrequency (%)
516118
 
9.9%
E434569
 
8.3%
N328926
 
6.3%
R272175
 
5.2%
A256011
 
4.9%
L249433
 
4.8%
D243639
 
4.7%
F204672
 
3.9%
U204066
 
3.9%
G193454
 
3.7%
Other values (60)2325111
44.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter3242006
62.0%
Lowercase Letter1389207
26.6%
Space Separator516118
 
9.9%
Dash Punctuation51380
 
1.0%
Other Punctuation19764
 
0.4%
Decimal Number9661
 
0.2%
Open Punctuation19
 
< 0.1%
Close Punctuation19
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E434569
13.4%
N328926
10.1%
R272175
8.4%
A256011
 
7.9%
L249433
 
7.7%
D243639
 
7.5%
F204672
 
6.3%
U204066
 
6.3%
G193454
 
6.0%
S180841
 
5.6%
Other values (15)674220
20.8%
Lowercase Letter
ValueCountFrequency (%)
t162150
11.7%
i146056
10.5%
r136214
9.8%
e124502
9.0%
n121925
8.8%
o104972
7.6%
a101491
7.3%
s97704
7.0%
p80290
 
5.8%
c75016
 
5.4%
Other values (15)238887
17.2%
Other Punctuation
ValueCountFrequency (%)
"11140
56.4%
.2645
 
13.4%
,2591
 
13.1%
&2385
 
12.1%
*681
 
3.4%
'228
 
1.2%
/88
 
0.4%
!6
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
92007
20.8%
12007
20.8%
22007
20.8%
81652
17.1%
61484
15.4%
0336
 
3.5%
4168
 
1.7%
Dash Punctuation
ValueCountFrequency (%)
-51350
99.9%
30
 
0.1%
Space Separator
ValueCountFrequency (%)
516118
100.0%
Open Punctuation
ValueCountFrequency (%)
(19
100.0%
Close Punctuation
ValueCountFrequency (%)
)19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin4631213
88.6%
Common596961
 
11.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
E434569
 
9.4%
N328926
 
7.1%
R272175
 
5.9%
A256011
 
5.5%
L249433
 
5.4%
D243639
 
5.3%
F204672
 
4.4%
U204066
 
4.4%
G193454
 
4.2%
S180841
 
3.9%
Other values (40)2063427
44.6%
Common
ValueCountFrequency (%)
516118
86.5%
-51350
 
8.6%
"11140
 
1.9%
.2645
 
0.4%
,2591
 
0.4%
&2385
 
0.4%
92007
 
0.3%
12007
 
0.3%
22007
 
0.3%
81652
 
0.3%
Other values (10)3059
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII5228144
> 99.9%
Punctuation30
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
516118
 
9.9%
E434569
 
8.3%
N328926
 
6.3%
R272175
 
5.2%
A256011
 
4.9%
L249433
 
4.8%
D243639
 
4.7%
F204672
 
3.9%
U204066
 
3.9%
G193454
 
3.7%
Other values (59)2325081
44.5%
Punctuation
ValueCountFrequency (%)
30
100.0%

job_title_description
Categorical

HIGH CARDINALITY
MISSING

Distinct3516
Distinct (%)1.2%
Missing107534
Missing (%)26.9%
Memory size25.3 MiB
Teacher, Elementary
30939 
Teacher, Short Term Sub
23450 
(blank)
 
15235
Teacher, Secondary (High)
 
8994
Teacher,Retrd Shrt Term Sub
 
8746
Other values (3511)
205379 

Length

Max length49
Median length23
Mean length21.9477289
Min length5

Characters and Unicode

Total characters6425044
Distinct characters75
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1119 ?
Unique (%)0.4%

Sample

1st rowTeacher-Elementary
2nd row(blank)
3rd rowTCHER 2ND GRADE
4th rowTeacher, Short Term Sub
5th rowTeacher, Secondary (High)

Common Values

ValueCountFrequency (%)
Teacher, Elementary 30939
 
7.7%
Teacher, Short Term Sub 23450
 
5.9%
(blank)15235
 
3.8%
Teacher, Secondary (High) 8994
 
2.2%
Teacher,Retrd Shrt Term Sub 8746
 
2.2%
TEACHER, REGULAR8517
 
2.1%
TEACHER SUBSTITUTE POOL7115
 
1.8%
SUB TEACHER ALL 7018
 
1.8%
Teacher Secondary (Middle) 6912
 
1.7%
Teacher5033
 
1.3%
Other values (3506)170784
42.7%
(Missing)107534
26.9%

Length

2021-09-28T14:56:12.821867image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
teacher133249
 
16.7%
sub45914
 
5.7%
elementary34140
 
4.3%
term33030
 
4.1%
short23450
 
2.9%
secondary18525
 
2.3%
blank15235
 
1.9%
regular11947
 
1.5%
high10844
 
1.4%
ed10474
 
1.3%
Other values (2015)462723
57.9%

Most occurring characters

ValueCountFrequency (%)
1079682
 
16.8%
e498866
 
7.8%
r363205
 
5.7%
T312003
 
4.9%
E292648
 
4.6%
a274708
 
4.3%
S261747
 
4.1%
c198484
 
3.1%
A184997
 
2.9%
h184859
 
2.9%
Other values (65)2773845
43.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2802939
43.6%
Uppercase Letter2260570
35.2%
Space Separator1079682
 
16.8%
Other Punctuation176125
 
2.7%
Open Punctuation39109
 
0.6%
Close Punctuation37846
 
0.6%
Dash Punctuation24415
 
0.4%
Decimal Number4357
 
0.1%
Math Symbol1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T312003
13.8%
E292648
12.9%
S261747
11.6%
A184997
 
8.2%
R176243
 
7.8%
I142774
 
6.3%
C141209
 
6.2%
L96235
 
4.3%
H92817
 
4.1%
O86062
 
3.8%
Other values (16)473835
21.0%
Lowercase Letter
ValueCountFrequency (%)
e498866
17.8%
r363205
13.0%
a274708
9.8%
c198484
 
7.1%
h184859
 
6.6%
t165851
 
5.9%
n142971
 
5.1%
o140081
 
5.0%
i119285
 
4.3%
l118893
 
4.2%
Other values (16)595736
21.3%
Decimal Number
ValueCountFrequency (%)
81353
31.1%
0664
15.2%
7627
14.4%
2372
 
8.5%
1366
 
8.4%
3248
 
5.7%
5242
 
5.6%
4232
 
5.3%
6223
 
5.1%
930
 
0.7%
Other Punctuation
ValueCountFrequency (%)
,153435
87.1%
/10935
 
6.2%
.9279
 
5.3%
&2237
 
1.3%
'215
 
0.1%
!17
 
< 0.1%
:6
 
< 0.1%
%1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
-24415
100.0%
Space Separator
ValueCountFrequency (%)
1079682
100.0%
Open Punctuation
ValueCountFrequency (%)
(39109
100.0%
Close Punctuation
ValueCountFrequency (%)
)37846
100.0%
Math Symbol
ValueCountFrequency (%)
+1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin5063509
78.8%
Common1361535
 
21.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e498866
 
9.9%
r363205
 
7.2%
T312003
 
6.2%
E292648
 
5.8%
a274708
 
5.4%
S261747
 
5.2%
c198484
 
3.9%
A184997
 
3.7%
h184859
 
3.7%
R176243
 
3.5%
Other values (42)2315749
45.7%
Common
ValueCountFrequency (%)
1079682
79.3%
,153435
 
11.3%
(39109
 
2.9%
)37846
 
2.8%
-24415
 
1.8%
/10935
 
0.8%
.9279
 
0.7%
&2237
 
0.2%
81353
 
0.1%
0664
 
< 0.1%
Other values (13)2580
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII6425044
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1079682
 
16.8%
e498866
 
7.8%
r363205
 
5.7%
T312003
 
4.9%
E292648
 
4.6%
a274708
 
4.3%
S261747
 
4.1%
c198484
 
3.1%
A184997
 
2.9%
h184859
 
2.9%
Other values (65)2773845
43.2%

text_3
Categorical

HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct34
Distinct (%)< 0.1%
Missing291125
Missing (%)72.7%
Memory size15.6 MiB
Regular
94462 
Turnaround
 
7755
Alternative
 
5299
Charter
 
664
New/Closed Schl
 
623
Other values (29)
 
349

Length

Max length39
Median length7
Mean length7.502638522
Min length4

Characters and Unicode

Total characters818928
Distinct characters47
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)< 0.1%

Sample

1st rowRegular
2nd rowRegular
3rd rowAlternative
4th rowRegular
5th rowRegular

Common Values

ValueCountFrequency (%)
Regular94462
 
23.6%
Turnaround7755
 
1.9%
Alternative5299
 
1.3%
Charter664
 
0.2%
New/Closed Schl623
 
0.2%
Other Materials and Supplies60
 
< 0.1%
Supplies for Instruction42
 
< 0.1%
Employee Travel41
 
< 0.1%
Staff Development39
 
< 0.1%
Professional and Contract Services36
 
< 0.1%
Other values (24)131
 
< 0.1%
(Missing)291125
72.7%

Length

2021-09-28T14:56:13.029897image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
regular94462
85.5%
turnaround7755
 
7.0%
alternative5299
 
4.8%
charter664
 
0.6%
new/closed623
 
0.6%
schl623
 
0.6%
and118
 
0.1%
supplies103
 
0.1%
other89
 
0.1%
for76
 
0.1%
Other values (56)638
 
0.6%

Most occurring characters

ValueCountFrequency (%)
r117114
14.3%
u110189
13.5%
a108682
13.3%
e107793
13.2%
l101387
12.4%
R94494
11.5%
g94469
11.5%
n21298
 
2.6%
t11831
 
1.4%
o8849
 
1.1%
Other values (37)42822
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter706099
86.2%
Uppercase Letter110905
 
13.5%
Space Separator1298
 
0.2%
Other Punctuation626
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r117114
16.6%
u110189
15.6%
a108682
15.4%
e107793
15.3%
l101387
14.4%
g94469
13.4%
n21298
 
3.0%
t11831
 
1.7%
o8849
 
1.3%
d8535
 
1.2%
Other values (15)15952
 
2.3%
Uppercase Letter
ValueCountFrequency (%)
R94494
85.2%
T7831
 
7.1%
A5311
 
4.8%
C1360
 
1.2%
S837
 
0.8%
N624
 
0.6%
O117
 
0.1%
I87
 
0.1%
E85
 
0.1%
M65
 
0.1%
Other values (8)94
 
0.1%
Other Punctuation
ValueCountFrequency (%)
/624
99.7%
,1
 
0.2%
'1
 
0.2%
Space Separator
ValueCountFrequency (%)
1298
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin817004
99.8%
Common1924
 
0.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
r117114
14.3%
u110189
13.5%
a108682
13.3%
e107793
13.2%
l101387
12.4%
R94494
11.6%
g94469
11.6%
n21298
 
2.6%
t11831
 
1.4%
o8849
 
1.1%
Other values (33)40898
 
5.0%
Common
ValueCountFrequency (%)
1298
67.5%
/624
32.4%
,1
 
0.1%
'1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII818928
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r117114
14.3%
u110189
13.5%
a108682
13.3%
e107793
13.2%
l101387
12.4%
R94494
11.5%
g94469
11.5%
n21298
 
2.6%
t11831
 
1.4%
o8849
 
1.1%
Other values (37)42822
 
5.2%

text_4
Categorical

HIGH CARDINALITY
MISSING

Distinct244
Distinct (%)0.5%
Missing346531
Missing (%)86.6%
Memory size14.8 MiB
Regular Instruction
10762 
Basic Educational Services - District Objective
9481 
Undistributed
7106 
Special Education Instruction
2892 
Regular Salary
2157 
Other values (239)
21348 

Length

Max length60
Median length19
Mean length24.85384959
Min length3

Characters and Unicode

Total characters1335795
Distinct characters66
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)< 0.1%

Sample

1st rowRegular Instruction
2nd rowRegular Instruction
3rd rowRegular Instruction
4th rowSIG - ARRA
5th rowTransportation - Bus Drivers

Common Values

ValueCountFrequency (%)
Regular Instruction10762
 
2.7%
Basic Educational Services - District Objective 9481
 
2.4%
Undistributed7106
 
1.8%
Special Education Instruction2892
 
0.7%
Regular Salary2157
 
0.5%
Regular Instructional Support1899
 
0.5%
transportation - Second Runs1852
 
0.5%
Transportation - Bus Drivers1257
 
0.3%
Office of Principal1072
 
0.3%
IDEA Part B 1063
 
0.3%
Other values (234)14205
 
3.5%
(Missing)346531
86.6%

Length

2021-09-28T14:56:13.265550image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
16353
 
9.7%
regular14911
 
8.8%
instruction14752
 
8.8%
services9638
 
5.7%
basic9485
 
5.6%
district9481
 
5.6%
objective9481
 
5.6%
educational9481
 
5.6%
undistributed7106
 
4.2%
transportation4162
 
2.5%
Other values (368)63641
37.8%

Most occurring characters

ValueCountFrequency (%)
127520
 
9.5%
t121010
 
9.1%
i114904
 
8.6%
r92099
 
6.9%
e91572
 
6.9%
a84431
 
6.3%
c84309
 
6.3%
n76289
 
5.7%
s69691
 
5.2%
u62987
 
4.7%
Other values (56)410983
30.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1035551
77.5%
Uppercase Letter152781
 
11.4%
Space Separator127520
 
9.5%
Dash Punctuation16036
 
1.2%
Other Punctuation2522
 
0.2%
Open Punctuation503
 
< 0.1%
Close Punctuation503
 
< 0.1%
Decimal Number379
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t121010
11.7%
i114904
11.1%
r92099
8.9%
e91572
8.8%
a84431
8.2%
c84309
8.1%
n76289
7.4%
s69691
 
6.7%
u62987
 
6.1%
o55052
 
5.3%
Other values (17)183207
17.7%
Uppercase Letter
ValueCountFrequency (%)
S23552
15.4%
I21607
14.1%
R18822
12.3%
E16994
11.1%
D13957
9.1%
B12822
8.4%
O11791
7.7%
T7679
 
5.0%
U7123
 
4.7%
P6682
 
4.4%
Other values (16)11752
7.7%
Other Punctuation
ValueCountFrequency (%)
&817
32.4%
,704
27.9%
.516
20.5%
/437
17.3%
'48
 
1.9%
Decimal Number
ValueCountFrequency (%)
9121
31.9%
786
22.7%
286
22.7%
086
22.7%
Space Separator
ValueCountFrequency (%)
127520
100.0%
Dash Punctuation
ValueCountFrequency (%)
-16036
100.0%
Open Punctuation
ValueCountFrequency (%)
(503
100.0%
Close Punctuation
ValueCountFrequency (%)
)503
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1188332
89.0%
Common147463
 
11.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
t121010
 
10.2%
i114904
 
9.7%
r92099
 
7.8%
e91572
 
7.7%
a84431
 
7.1%
c84309
 
7.1%
n76289
 
6.4%
s69691
 
5.9%
u62987
 
5.3%
o55052
 
4.6%
Other values (43)335988
28.3%
Common
ValueCountFrequency (%)
127520
86.5%
-16036
 
10.9%
&817
 
0.6%
,704
 
0.5%
.516
 
0.3%
(503
 
0.3%
)503
 
0.3%
/437
 
0.3%
9121
 
0.1%
786
 
0.1%
Other values (3)220
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1335723
> 99.9%
Latin 1 Sup36
 
< 0.1%
Latin Ext B36
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
127520
 
9.5%
t121010
 
9.1%
i114904
 
8.6%
r92099
 
6.9%
e91572
 
6.9%
a84431
 
6.3%
c84309
 
6.3%
n76289
 
5.7%
s69691
 
5.2%
u62987
 
4.7%
Other values (53)410911
30.8%
Latin 1 Sup
ValueCountFrequency (%)
Ã18
50.0%
Â18
50.0%
Latin Ext B
ValueCountFrequency (%)
ƒ36
100.0%

sub_object_description
Categorical

HIGH CARDINALITY
MISSING

Distinct182
Distinct (%)0.2%
Missing308674
Missing (%)77.1%
Memory size17.2 MiB
Extra Duty Pay/Overtime For Support Personnel
9159 
Certificated Employees Salaries And Wages
8720 
Salaries And Wages For Teachers And Other Professi
7285 
Salaries And Wages For Substitute Teachers
6327 
General Supplies *
5829 
Other values (177)
54283 

Length

Max length62
Median length37
Mean length31.6275777
Min length3

Characters and Unicode

Total characters2897181
Distinct characters61
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)< 0.1%

Sample

1st rowEquipment *
2nd rowCertificated Employees Salaries And Wages
3rd rowSalaries And Wages For Substitute Teachers
4th rowTravel Mileage/Meeting Expense *
5th rowPurchased Services

Common Values

ValueCountFrequency (%)
Extra Duty Pay/Overtime For Support Personnel9159
 
2.3%
Certificated Employees Salaries And Wages8720
 
2.2%
Salaries And Wages For Teachers And Other Professi7285
 
1.8%
Salaries And Wages For Substitute Teachers6327
 
1.6%
General Supplies *5829
 
1.5%
Salaries Or Wages For Support Personnel4824
 
1.2%
Supplies And Materials3195
 
0.8%
Extended Day3045
 
0.8%
Extra Duty Wages2730
 
0.7%
Purchased Services2534
 
0.6%
Other values (172)37955
 
9.5%
(Missing)308674
77.1%

Length

2021-09-28T14:56:13.496516image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
and45090
 
11.0%
wages34216
 
8.3%
salaries31486
 
7.7%
for28718
 
7.0%
24804
 
6.0%
personnel15484
 
3.8%
support15484
 
3.8%
teachers13911
 
3.4%
supplies12958
 
3.2%
extra12084
 
2.9%
Other values (265)176923
43.0%

Most occurring characters

ValueCountFrequency (%)
e350961
 
12.1%
320599
 
11.1%
r220795
 
7.6%
a209049
 
7.2%
s189968
 
6.6%
t150081
 
5.2%
i144138
 
5.0%
n128855
 
4.4%
l109893
 
3.8%
o107972
 
3.7%
Other values (51)964870
33.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2132840
73.6%
Uppercase Letter399495
 
13.8%
Space Separator320599
 
11.1%
Other Punctuation31659
 
1.1%
Dash Punctuation7208
 
0.2%
Open Punctuation2695
 
0.1%
Close Punctuation2211
 
0.1%
Math Symbol474
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e350961
16.5%
r220795
10.4%
a209049
9.8%
s189968
8.9%
t150081
 
7.0%
i144138
 
6.8%
n128855
 
6.0%
l109893
 
5.2%
o107972
 
5.1%
d82904
 
3.9%
Other values (16)438224
20.5%
Uppercase Letter
ValueCountFrequency (%)
S80190
20.1%
A47795
12.0%
P44063
11.0%
W37050
9.3%
E33371
8.4%
F31429
 
7.9%
O27892
 
7.0%
T27426
 
6.9%
C18882
 
4.7%
D15549
 
3.9%
Other values (15)35848
9.0%
Other Punctuation
ValueCountFrequency (%)
*18351
58.0%
/12013
37.9%
,1163
 
3.7%
&76
 
0.2%
"56
 
0.2%
Space Separator
ValueCountFrequency (%)
320599
100.0%
Dash Punctuation
ValueCountFrequency (%)
-7208
100.0%
Open Punctuation
ValueCountFrequency (%)
(2695
100.0%
Close Punctuation
ValueCountFrequency (%)
)2211
100.0%
Math Symbol
ValueCountFrequency (%)
>474
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2532335
87.4%
Common364846
 
12.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
e350961
13.9%
r220795
 
8.7%
a209049
 
8.3%
s189968
 
7.5%
t150081
 
5.9%
i144138
 
5.7%
n128855
 
5.1%
l109893
 
4.3%
o107972
 
4.3%
d82904
 
3.3%
Other values (41)837719
33.1%
Common
ValueCountFrequency (%)
320599
87.9%
*18351
 
5.0%
/12013
 
3.3%
-7208
 
2.0%
(2695
 
0.7%
)2211
 
0.6%
,1163
 
0.3%
>474
 
0.1%
&76
 
< 0.1%
"56
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII2897181
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e350961
 
12.1%
320599
 
11.1%
r220795
 
7.6%
a209049
 
7.2%
s189968
 
6.6%
t150081
 
5.2%
i144138
 
5.0%
n128855
 
4.4%
l109893
 
3.8%
o107972
 
3.7%
Other values (51)964870
33.3%

location_description
Categorical

HIGH CARDINALITY
MISSING

Distinct354
Distinct (%)0.2%
Missing238223
Missing (%)59.5%
Memory size18.3 MiB
School
65524 
ADMIN. SERVICES
7749 
Unallocated
 
6005
SPECIAL EDUCATION
 
5485
Undistributed
 
5309
Other values (349)
71982 

Length

Max length42
Median length11
Mean length14.47584139
Min length3

Characters and Unicode

Total characters2345868
Distinct characters56
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)< 0.1%

Sample

1st rowDISTRICT WIDE ORGANIZATION UNI
2nd rowCHARTER
3rd rowSchool
4th rowSchool
5th rowSAFETY AND SECURITY DIVISION

Common Values

ValueCountFrequency (%)
School 65524
 
16.4%
ADMIN. SERVICES7749
 
1.9%
Unallocated6005
 
1.5%
SPECIAL EDUCATION5485
 
1.4%
Undistributed5309
 
1.3%
District Wide Resources3626
 
0.9%
OPPORTUNITY SCHOOL3044
 
0.8%
TRANSPORTATION 2788
 
0.7%
TEACHER LEARNING & LEADERSHIP2482
 
0.6%
Summer School2339
 
0.6%
Other values (344)57703
 
14.4%
(Missing)238223
59.5%

Length

2021-09-28T14:56:13.745887image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
school75382
25.4%
services13381
 
4.5%
11518
 
3.9%
education10636
 
3.6%
admin7768
 
2.6%
unallocated6005
 
2.0%
special5574
 
1.9%
undistributed5309
 
1.8%
and4978
 
1.7%
resources4968
 
1.7%
Other values (412)150749
50.9%

Most occurring characters

ValueCountFrequency (%)
299665
 
12.8%
S179107
 
7.6%
E163283
 
7.0%
o150316
 
6.4%
I119289
 
5.1%
A109923
 
4.7%
T101724
 
4.3%
R101187
 
4.3%
O98425
 
4.2%
N96555
 
4.1%
Other values (46)926394
39.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter1436369
61.2%
Lowercase Letter582148
24.8%
Space Separator299665
 
12.8%
Other Punctuation20677
 
0.9%
Dash Punctuation6844
 
0.3%
Open Punctuation79
 
< 0.1%
Close Punctuation73
 
< 0.1%
Decimal Number13
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S179107
12.5%
E163283
11.4%
I119289
 
8.3%
A109923
 
7.7%
T101724
 
7.1%
R101187
 
7.0%
O98425
 
6.9%
N96555
 
6.7%
C92910
 
6.5%
D64201
 
4.5%
Other values (16)309765
21.6%
Lowercase Letter
ValueCountFrequency (%)
o150316
25.8%
c82472
14.2%
l80557
13.8%
h67910
11.7%
t28622
 
4.9%
i27489
 
4.7%
e27338
 
4.7%
d22455
 
3.9%
r19333
 
3.3%
s18570
 
3.2%
Other values (10)57086
 
9.8%
Other Punctuation
ValueCountFrequency (%)
.8208
39.7%
&7343
35.5%
/4958
24.0%
,129
 
0.6%
'39
 
0.2%
Space Separator
ValueCountFrequency (%)
299665
100.0%
Dash Punctuation
ValueCountFrequency (%)
-6844
100.0%
Open Punctuation
ValueCountFrequency (%)
(79
100.0%
Close Punctuation
ValueCountFrequency (%)
)73
100.0%
Decimal Number
ValueCountFrequency (%)
813
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2018517
86.0%
Common327351
 
14.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
S179107
 
8.9%
E163283
 
8.1%
o150316
 
7.4%
I119289
 
5.9%
A109923
 
5.4%
T101724
 
5.0%
R101187
 
5.0%
O98425
 
4.9%
N96555
 
4.8%
C92910
 
4.6%
Other values (36)805798
39.9%
Common
ValueCountFrequency (%)
299665
91.5%
.8208
 
2.5%
&7343
 
2.2%
-6844
 
2.1%
/4958
 
1.5%
,129
 
< 0.1%
(79
 
< 0.1%
)73
 
< 0.1%
'39
 
< 0.1%
813
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII2345868
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
299665
 
12.8%
S179107
 
7.6%
E163283
 
7.0%
o150316
 
6.4%
I119289
 
5.1%
A109923
 
4.7%
T101724
 
4.3%
R101187
 
4.3%
O98425
 
4.2%
N96555
 
4.1%
Other values (46)926394
39.5%

fte
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
MISSING
ZEROS

Distinct21003
Distinct (%)16.7%
Missing274206
Missing (%)68.5%
Infinite0
Infinite (%)0.0%
Mean0.4267939847
Minimum-0.08755063657
Maximum46.8
Zeros31338
Zeros (%)7.8%
Negative51
Negative (%)< 0.1%
Memory size3.1 MiB
2021-09-28T14:56:13.856926image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/

Quantile statistics

Minimum-0.08755063657
5-th percentile0
Q10.0007918519478
median0.1309267241
Q31
95-th percentile1
Maximum46.8
Range46.88755064
Interquartile range (IQR)0.9992081481

Descriptive statistics

Standard deviation0.5735755013
Coefficient of variation (CV)1.343916554
Kurtosis1172.951126
Mean0.4267939847
Median Absolute Deviation (MAD)0.1309267241
Skewness19.27369781
Sum53806.34445
Variance0.3289888556
MonotonicityNot monotonic
2021-09-28T14:56:13.968010image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
135788
 
8.9%
031338
 
7.8%
0.0043103448288130
 
2.0%
0.0021551724142735
 
0.7%
0.0086206896552293
 
0.6%
0.0252020
 
0.5%
0.51825
 
0.5%
0.01293103448941
 
0.2%
0.006465517241712
 
0.2%
0.01724137931479
 
0.1%
Other values (20993)39810
 
9.9%
(Missing)274206
68.5%
ValueCountFrequency (%)
-0.087550636571
< 0.1%
-0.083335743881
< 0.1%
-0.067935789431
< 0.1%
-0.06122448981
< 0.1%
-0.054347793261
< 0.1%
-0.041671140061
< 0.1%
-0.038869156151
< 0.1%
-0.0382981
< 0.1%
-0.030428571431
< 0.1%
-0.030203396721
< 0.1%
ValueCountFrequency (%)
46.81
< 0.1%
45.61
< 0.1%
40.21
< 0.1%
34.21
< 0.1%
31.11
< 0.1%
271
< 0.1%
24.61
< 0.1%
23.11
< 0.1%
22.81
< 0.1%
22.51
< 0.1%

function_description
Categorical

HIGH CARDINALITY
MISSING

Distinct687
Distinct (%)0.2%
Missing58082
Missing (%)14.5%
Memory size27.1 MiB
NON-PROJECT
76890 
Instruction
27008 
UNALLOC BUDGETS/SCHOOLS
 
16616
BASIC (FEFP K-12)
 
13317
EMPLOYEE RETIREMENT
 
13073
Other values (682)
195291 

Length

Max length64
Median length19
Mean length20.49312819
Min length3

Characters and Unicode

Total characters7012646
Distinct characters66
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)< 0.1%

Sample

1st rowRGN GOB
2nd rowUNALLOC BUDGETS/SCHOOLS
3rd rowNON-PROJECT
4th rowNON-PROJECT
5th rowNON-PROJECT

Common Values

ValueCountFrequency (%)
NON-PROJECT76890
19.2%
Instruction27008
 
6.7%
UNALLOC BUDGETS/SCHOOLS16616
 
4.2%
BASIC (FEFP K-12) 13317
 
3.3%
EMPLOYEE RETIREMENT13073
 
3.3%
INSTRUCTION12924
 
3.2%
Disadvantaged Youth *7814
 
2.0%
ELA E-TEACHING SHELTERED ENG6311
 
1.6%
Instruction And Curriculum Development Services *5235
 
1.3%
INST STAFF TRAINING SVCS 4957
 
1.2%
Other values (677)158050
39.5%
(Missing)58082
 
14.5%

Length

2021-09-28T14:56:14.227886image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
non-project76890
 
9.0%
61809
 
7.2%
instruction47488
 
5.5%
services41125
 
4.8%
and24465
 
2.9%
unalloc16616
 
1.9%
budgets/schools16616
 
1.9%
title14774
 
1.7%
k-1213949
 
1.6%
basic13393
 
1.6%
Other values (814)528738
61.8%

Most occurring characters

ValueCountFrequency (%)
941095
 
13.4%
E470468
 
6.7%
N369821
 
5.3%
O354388
 
5.1%
T347269
 
5.0%
I297030
 
4.2%
S267232
 
3.8%
R263198
 
3.8%
C262678
 
3.7%
A244096
 
3.5%
Other values (56)3195371
45.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter3948389
56.3%
Lowercase Letter1840944
26.3%
Space Separator941095
 
13.4%
Dash Punctuation140629
 
2.0%
Other Punctuation72422
 
1.0%
Decimal Number27901
 
0.4%
Open Punctuation20628
 
0.3%
Close Punctuation20628
 
0.3%
Math Symbol10
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E470468
11.9%
N369821
9.4%
O354388
 
9.0%
T347269
 
8.8%
I297030
 
7.5%
S267232
 
6.8%
R263198
 
6.7%
C262678
 
6.7%
A244096
 
6.2%
P181367
 
4.6%
Other values (16)890842
22.6%
Lowercase Letter
ValueCountFrequency (%)
n193033
10.5%
i191438
10.4%
t184381
10.0%
e175948
9.6%
r160226
8.7%
o129867
 
7.1%
c126318
 
6.9%
u115187
 
6.3%
s113062
 
6.1%
a101084
 
5.5%
Other values (15)350400
19.0%
Other Punctuation
ValueCountFrequency (%)
*35211
48.6%
/21129
29.2%
.6342
 
8.8%
"3366
 
4.6%
,3268
 
4.5%
&3094
 
4.3%
'10
 
< 0.1%
:2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
213951
50.0%
113950
50.0%
Space Separator
ValueCountFrequency (%)
941095
100.0%
Dash Punctuation
ValueCountFrequency (%)
-140629
100.0%
Open Punctuation
ValueCountFrequency (%)
(20628
100.0%
Close Punctuation
ValueCountFrequency (%)
)20628
100.0%
Math Symbol
ValueCountFrequency (%)
+10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin5789333
82.6%
Common1223313
 
17.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
E470468
 
8.1%
N369821
 
6.4%
O354388
 
6.1%
T347269
 
6.0%
I297030
 
5.1%
S267232
 
4.6%
R263198
 
4.5%
C262678
 
4.5%
A244096
 
4.2%
n193033
 
3.3%
Other values (41)2720120
47.0%
Common
ValueCountFrequency (%)
941095
76.9%
-140629
 
11.5%
*35211
 
2.9%
/21129
 
1.7%
(20628
 
1.7%
)20628
 
1.7%
213951
 
1.1%
113950
 
1.1%
.6342
 
0.5%
"3366
 
0.3%
Other values (5)6384
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII7012646
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
941095
 
13.4%
E470468
 
6.7%
N369821
 
5.3%
O354388
 
5.1%
T347269
 
5.0%
I297030
 
4.2%
S267232
 
3.8%
R263198
 
3.8%
C262678
 
3.7%
A244096
 
3.5%
Other values (56)3195371
45.6%

facility_or_department
Categorical

HIGH CARDINALITY
MISSING

Distinct179
Distinct (%)0.3%
Missing346391
Missing (%)86.5%
Memory size14.6 MiB
All Campus Payroll
17697 
Instruction And Curriculum
13815 
Transportation Department
5542 
Child Nutrition
3405 
Custodial Department
2173 
Other values (174)
11254 

Length

Max length43
Median length20
Mean length21.48090413
Min length3

Characters and Unicode

Total characters1157520
Distinct characters57
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)< 0.1%

Sample

1st rowPOSITION CONTROL POOLS
2nd rowAll Campus Payroll
3rd rowTransportation Department
4th rowTransportation Department
5th rowCustodial Department

Common Values

ValueCountFrequency (%)
All Campus Payroll17697
 
4.4%
Instruction And Curriculum13815
 
3.5%
Transportation Department5542
 
1.4%
Child Nutrition3405
 
0.9%
Custodial Department2173
 
0.5%
Athletic Department773
 
0.2%
Instruction and Curriculum722
 
0.2%
Finance Department474
 
0.1%
WRITING TEAMS 438
 
0.1%
Performing Arts Department384
 
0.1%
Other values (169)8463
 
2.1%
(Missing)346391
86.5%

Length

2021-09-28T14:56:14.485434image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
campus17954
12.5%
payroll17790
12.4%
all17697
12.3%
and15455
10.8%
curriculum14790
10.3%
instruction14736
10.3%
department10606
7.4%
transportation5587
 
3.9%
child3405
 
2.4%
nutrition3405
 
2.4%
Other values (239)21897
15.3%

Most occurring characters

ValueCountFrequency (%)
111157
 
9.6%
l93994
 
8.1%
r91679
 
7.9%
u83982
 
7.3%
t77268
 
6.7%
n76135
 
6.6%
a65557
 
5.7%
i54770
 
4.7%
o53606
 
4.6%
m45396
 
3.9%
Other values (47)403976
34.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter832016
71.9%
Uppercase Letter212420
 
18.4%
Space Separator111157
 
9.6%
Other Punctuation1500
 
0.1%
Dash Punctuation170
 
< 0.1%
Open Punctuation138
 
< 0.1%
Close Punctuation119
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
l93994
11.3%
r91679
11.0%
u83982
10.1%
t77268
9.3%
n76135
9.2%
a65557
7.9%
i54770
 
6.6%
o53606
 
6.4%
m45396
 
5.5%
s43965
 
5.3%
Other values (14)145664
17.5%
Uppercase Letter
ValueCountFrequency (%)
C44693
21.0%
A40815
19.2%
I22390
10.5%
P20951
9.9%
T14321
 
6.7%
D12853
 
6.1%
N9299
 
4.4%
E8844
 
4.2%
S8152
 
3.8%
R5614
 
2.6%
Other values (13)24488
11.5%
Other Punctuation
ValueCountFrequency (%)
"817
54.5%
,405
27.0%
'78
 
5.2%
&74
 
4.9%
/70
 
4.7%
:56
 
3.7%
Space Separator
ValueCountFrequency (%)
111157
100.0%
Open Punctuation
ValueCountFrequency (%)
(138
100.0%
Close Punctuation
ValueCountFrequency (%)
)119
100.0%
Dash Punctuation
ValueCountFrequency (%)
-170
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1044436
90.2%
Common113084
 
9.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
l93994
 
9.0%
r91679
 
8.8%
u83982
 
8.0%
t77268
 
7.4%
n76135
 
7.3%
a65557
 
6.3%
i54770
 
5.2%
o53606
 
5.1%
m45396
 
4.3%
C44693
 
4.3%
Other values (37)357356
34.2%
Common
ValueCountFrequency (%)
111157
98.3%
"817
 
0.7%
,405
 
0.4%
-170
 
0.2%
(138
 
0.1%
)119
 
0.1%
'78
 
0.1%
&74
 
0.1%
/70
 
0.1%
:56
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1157520
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
111157
 
9.6%
l93994
 
8.1%
r91679
 
7.9%
u83982
 
7.3%
t77268
 
6.7%
n76135
 
6.6%
a65557
 
5.7%
i54770
 
4.7%
o53606
 
4.6%
m45396
 
3.9%
Other values (47)403976
34.9%

position_extra
Categorical

HIGH CARDINALITY
MISSING

Distinct580
Distinct (%)0.2%
Missing135513
Missing (%)33.9%
Memory size23.6 MiB
PROFESSIONAL-INSTRUCTIONAL
92136 
UNDESIGNATED
48273 
CRAFTS, TRADES, AND SERVICES
13015 
PARAPROFESSIONAL
12639 
TEACHER BACHELOR
11875 
Other values (575)
86826 

Length

Max length40
Median length20
Mean length20.01482075
Min length4

Characters and Unicode

Total characters5299204
Distinct characters59
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)< 0.1%

Sample

1st rowKINDERGARTEN
2nd rowUNDESIGNATED
3rd rowTEACHER
4th rowPROFESSIONAL-INSTRUCTIONAL
5th rowPROFESSIONAL-INSTRUCTIONAL

Common Values

ValueCountFrequency (%)
PROFESSIONAL-INSTRUCTIONAL92136
23.0%
UNDESIGNATED48273
 
12.1%
CRAFTS, TRADES, AND SERVICES13015
 
3.3%
PARAPROFESSIONAL12639
 
3.2%
TEACHER BACHELOR11875
 
3.0%
SUBSTITUTE TEACHER9622
 
2.4%
OFFICE/ADMINISTRATIVE SUPPORT6437
 
1.6%
PROFESSIONAL-OTHER5842
 
1.5%
TEACHER MASTER5114
 
1.3%
BUS DRIVER4962
 
1.2%
Other values (570)54849
13.7%
(Missing)135513
33.9%

Length

2021-09-28T14:56:15.039067image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
professional-instructional92136
21.5%
undesignated48273
 
11.3%
teacher32469
 
7.6%
substitute17448
 
4.1%
and13901
 
3.2%
services13499
 
3.2%
trades13015
 
3.0%
crafts13015
 
3.0%
paraprofessional12639
 
3.0%
bachelor11875
 
2.8%
Other values (502)160074
37.4%

Most occurring characters

ValueCountFrequency (%)
S524512
 
9.9%
I468374
 
8.8%
E458880
 
8.7%
N442619
 
8.4%
A436945
 
8.2%
T422069
 
8.0%
R395264
 
7.5%
O381348
 
7.2%
L241346
 
4.6%
C215182
 
4.1%
Other values (49)1312665
24.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter4908200
92.6%
Space Separator197480
 
3.7%
Dash Punctuation102364
 
1.9%
Lowercase Letter50966
 
1.0%
Other Punctuation38320
 
0.7%
Decimal Number1668
 
< 0.1%
Open Punctuation103
 
< 0.1%
Close Punctuation103
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S524512
10.7%
I468374
9.5%
E458880
9.3%
N442619
9.0%
A436945
8.9%
T422069
8.6%
R395264
8.1%
O381348
7.8%
L241346
 
4.9%
C215182
 
4.4%
Other values (16)921661
18.8%
Lowercase Letter
ValueCountFrequency (%)
d11780
23.1%
e8060
15.8%
r7866
15.4%
a5972
11.7%
l4560
 
8.9%
n4030
 
7.9%
t2307
 
4.5%
i1919
 
3.8%
x1917
 
3.8%
g1917
 
3.8%
Other values (8)638
 
1.3%
Decimal Number
ValueCountFrequency (%)
2484
29.0%
3318
19.1%
1295
17.7%
5289
17.3%
4258
15.5%
917
 
1.0%
67
 
0.4%
Other Punctuation
ValueCountFrequency (%)
,26498
69.1%
/9119
 
23.8%
.1360
 
3.5%
&1343
 
3.5%
Space Separator
ValueCountFrequency (%)
197480
100.0%
Dash Punctuation
ValueCountFrequency (%)
-102364
100.0%
Open Punctuation
ValueCountFrequency (%)
(103
100.0%
Close Punctuation
ValueCountFrequency (%)
)103
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin4959166
93.6%
Common340038
 
6.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
S524512
10.6%
I468374
9.4%
E458880
9.3%
N442619
8.9%
A436945
8.8%
T422069
8.5%
R395264
8.0%
O381348
 
7.7%
L241346
 
4.9%
C215182
 
4.3%
Other values (34)972627
19.6%
Common
ValueCountFrequency (%)
197480
58.1%
-102364
30.1%
,26498
 
7.8%
/9119
 
2.7%
.1360
 
0.4%
&1343
 
0.4%
2484
 
0.1%
3318
 
0.1%
1295
 
0.1%
5289
 
0.1%
Other values (5)488
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII5299204
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
S524512
 
9.9%
I468374
 
8.8%
E458880
 
8.7%
N442619
 
8.4%
A436945
 
8.2%
T422069
 
8.0%
R395264
 
7.5%
O381348
 
7.2%
L241346
 
4.6%
C215182
 
4.1%
Other values (49)1312665
24.8%

total
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
MISSING
SKEWED

Distinct286430
Distinct (%)72.4%
Missing4555
Missing (%)1.1%
Infinite0
Infinite (%)0.0%
Mean13105.85683
Minimum-87466307.15
Maximum129699999.2
Zeros46
Zeros (%)< 0.1%
Negative43870
Negative (%)11.0%
Memory size3.1 MiB
2021-09-28T14:56:15.151703image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/

Quantile statistics

Minimum-87466307.15
5-th percentile-706.9685
Q173.7977
median461.23
Q33652.6625
95-th percentile64813.34174
Maximum129699999.2
Range217166306.4
Interquartile range (IQR)3578.8648

Descriptive statistics

Standard deviation368225.3924
Coefficient of variation (CV)28.09624714
Kurtosis51040.80173
Mean13105.85683
Median Absolute Deviation (MAD)471.9874
Skewness100.3197995
Sum5186275876
Variance1.355899396 × 1011
MonotonicityNot monotonic
2021-09-28T14:56:15.257281image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.2261
 
< 0.1%
0.1255
 
< 0.1%
-0.155
 
< 0.1%
0.2154
 
< 0.1%
0.0654
 
< 0.1%
0.0952
 
< 0.1%
-0.0352
 
< 0.1%
0.2451
 
< 0.1%
109.997938551
 
< 0.1%
0.2851
 
< 0.1%
Other values (286420)395186
98.7%
(Missing)4555
 
1.1%
ValueCountFrequency (%)
-87466307.151
< 0.1%
-47890569.541
< 0.1%
-26464999.511
< 0.1%
-23999999.991
< 0.1%
-23999663.971
< 0.1%
-22699999.681
< 0.1%
-12159238.061
< 0.1%
-6417510.631
< 0.1%
-5779383.851
< 0.1%
-5643908.41
< 0.1%
ValueCountFrequency (%)
129699999.21
< 0.1%
53237974.891
< 0.1%
47890568.711
< 0.1%
39564458.631
< 0.1%
36450944.431
< 0.1%
28251184.281
< 0.1%
27623683.921
< 0.1%
26464999.731
< 0.1%
26181208.891
< 0.1%
25162929.661
< 0.1%

program_description
Categorical

HIGH CARDINALITY
MISSING

Distinct421
Distinct (%)0.1%
Missing95617
Missing (%)23.9%
Memory size26.5 MiB
GENERAL ELEMENTARY EDUCATION
32829 
EMPLOYEE BENEFITS
32669 
INSTRUCTIONAL STAFF TRAINING
21521 
Undistributed
18547 
Instruction - Regular
 
13825
Other values (416)
185269 

Length

Max length123
Median length25
Mean length24.26858137
Min length2

Characters and Unicode

Total characters7393666
Distinct characters67
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)< 0.1%

Sample

1st rowKINDERGARTEN
2nd rowBUILDING IMPROVEMENT SERVICES
3rd rowInstruction - Regular
4th rowGENERAL MIDDLE/JUNIOR HIGH SCH
5th rowGENERAL HIGH SCHOOL EDUCATION

Common Values

ValueCountFrequency (%)
GENERAL ELEMENTARY EDUCATION32829
 
8.2%
EMPLOYEE BENEFITS32669
 
8.2%
INSTRUCTIONAL STAFF TRAINING21521
 
5.4%
Undistributed18547
 
4.6%
Instruction - Regular13825
 
3.5%
Misc13143
 
3.3%
GENERAL HIGH SCHOOL EDUCATION10625
 
2.7%
Basic Educational Services10593
 
2.6%
"Title I, Part A Schoolwide Activities Related To State Comp8520
 
2.1%
GENERAL MIDDLE/JUNIOR HIGH SCH7179
 
1.8%
Other values (411)135209
33.8%
(Missing)95617
23.9%

Length

2021-09-28T14:56:15.506677image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
education70865
 
7.7%
general60866
 
6.6%
35454
 
3.9%
elementary34249
 
3.7%
services33644
 
3.7%
benefits32669
 
3.6%
employee32669
 
3.6%
staff25257
 
2.8%
instruction23492
 
2.6%
instructional22950
 
2.5%
Other values (560)543529
59.4%

Most occurring characters

ValueCountFrequency (%)
E688744
 
9.3%
625168
 
8.5%
I416433
 
5.6%
A402471
 
5.4%
N387010
 
5.2%
R365377
 
4.9%
T359796
 
4.9%
S322943
 
4.4%
O302475
 
4.1%
L251910
 
3.4%
Other values (57)3271339
44.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter4936065
66.8%
Lowercase Letter1688779
 
22.8%
Space Separator625168
 
8.5%
Dash Punctuation66429
 
0.9%
Other Punctuation45755
 
0.6%
Decimal Number21289
 
0.3%
Open Punctuation5089
 
0.1%
Close Punctuation5089
 
0.1%
Math Symbol3
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E688744
14.0%
I416433
 
8.4%
A402471
 
8.2%
N387010
 
7.8%
R365377
 
7.4%
T359796
 
7.3%
S322943
 
6.5%
O302475
 
6.1%
L251910
 
5.1%
C230903
 
4.7%
Other values (16)1208003
24.5%
Lowercase Letter
ValueCountFrequency (%)
i219219
13.0%
t204232
12.1%
e159529
9.4%
a122628
 
7.3%
c119013
 
7.0%
s116224
 
6.9%
n111735
 
6.6%
r107082
 
6.3%
o99614
 
5.9%
u96280
 
5.7%
Other values (14)333223
19.7%
Other Punctuation
ValueCountFrequency (%)
/16567
36.2%
,12210
26.7%
"8520
18.6%
&5637
 
12.3%
'1806
 
3.9%
:561
 
1.2%
.454
 
1.0%
Decimal Number
ValueCountFrequency (%)
88353
39.2%
13667
17.2%
23667
17.2%
93156
 
14.8%
62446
 
11.5%
Space Separator
ValueCountFrequency (%)
625168
100.0%
Dash Punctuation
ValueCountFrequency (%)
-66429
100.0%
Open Punctuation
ValueCountFrequency (%)
(5089
100.0%
Close Punctuation
ValueCountFrequency (%)
)5089
100.0%
Math Symbol
ValueCountFrequency (%)
+3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin6624844
89.6%
Common768822
 
10.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
E688744
 
10.4%
I416433
 
6.3%
A402471
 
6.1%
N387010
 
5.8%
R365377
 
5.5%
T359796
 
5.4%
S322943
 
4.9%
O302475
 
4.6%
L251910
 
3.8%
C230903
 
3.5%
Other values (40)2896782
43.7%
Common
ValueCountFrequency (%)
625168
81.3%
-66429
 
8.6%
/16567
 
2.2%
,12210
 
1.6%
"8520
 
1.1%
88353
 
1.1%
&5637
 
0.7%
(5089
 
0.7%
)5089
 
0.7%
13667
 
0.5%
Other values (7)12093
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII7393666
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E688744
 
9.3%
625168
 
8.5%
I416433
 
5.6%
A402471
 
5.4%
N387010
 
5.2%
R365377
 
4.9%
T359796
 
4.9%
S322943
 
4.4%
O302475
 
4.1%
L251910
 
3.4%
Other values (57)3271339
44.2%

fund_description
Categorical

HIGH CARDINALITY
MISSING

Distinct141
Distinct (%)0.1%
Missing197400
Missing (%)49.3%
Memory size22.0 MiB
General Operating Fund
33467 
General Fund
29036 
GENERAL FUND
28176 
General Purpose School
20333 
Title I - Disadvantaged Children/Targeted Assistance
19967 
Other values (136)
71898 

Length

Max length66
Median length22
Mean length25.46656348
Min length7

Characters and Unicode

Total characters5166580
Distinct characters63
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)< 0.1%

Sample

1st rowGeneral Fund
2nd rowGeneral Purpose School
3rd rowLOCAL FUND
4th rowSchool to Work
5th rowGeneral Fund

Common Values

ValueCountFrequency (%)
General Operating Fund33467
 
8.4%
General Fund29036
 
7.3%
GENERAL FUND 28176
 
7.0%
General Purpose School20333
 
5.1%
Title I - Disadvantaged Children/Targeted Assistance19967
 
5.0%
General7877
 
2.0%
"Title Part A Improving Basic Programs"6885
 
1.7%
Special Trust6335
 
1.6%
School Federal Projects5424
 
1.4%
FED THRU STATE-CASH ADVANCE 3544
 
0.9%
Other values (131)41833
 
10.5%
(Missing)197400
49.3%

Length

2021-09-28T14:56:15.769926image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
general118945
18.7%
fund97336
15.3%
school33665
 
5.3%
operating33467
 
5.2%
title32732
 
5.1%
22254
 
3.5%
i21656
 
3.4%
purpose20351
 
3.2%
assistance19984
 
3.1%
disadvantaged19967
 
3.1%
Other values (226)217406
34.1%

Most occurring characters

ValueCountFrequency (%)
1033092
20.0%
e440957
 
8.5%
a316666
 
6.1%
n302072
 
5.8%
r275016
 
5.3%
l215035
 
4.2%
i197068
 
3.8%
t196119
 
3.8%
d173033
 
3.3%
s154762
 
3.0%
Other values (53)1862760
36.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2966133
57.4%
Uppercase Letter1101997
 
21.3%
Space Separator1033092
 
20.0%
Other Punctuation36315
 
0.7%
Dash Punctuation26841
 
0.5%
Decimal Number1224
 
< 0.1%
Open Punctuation489
 
< 0.1%
Close Punctuation489
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
G125149
11.4%
F120449
10.9%
E92594
 
8.4%
A90138
 
8.2%
T80296
 
7.3%
N77369
 
7.0%
S74681
 
6.8%
D65679
 
6.0%
P53341
 
4.8%
O51959
 
4.7%
Other values (15)270342
24.5%
Lowercase Letter
ValueCountFrequency (%)
e440957
14.9%
a316666
10.7%
n302072
10.2%
r275016
9.3%
l215035
 
7.2%
i197068
 
6.6%
t196119
 
6.6%
d173033
 
5.8%
s154762
 
5.2%
o146243
 
4.9%
Other values (15)549162
18.5%
Other Punctuation
ValueCountFrequency (%)
/19968
55.0%
"13922
38.3%
,790
 
2.2%
&674
 
1.9%
.584
 
1.6%
:375
 
1.0%
'2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1612
50.0%
2612
50.0%
Space Separator
ValueCountFrequency (%)
1033092
100.0%
Dash Punctuation
ValueCountFrequency (%)
-26841
100.0%
Open Punctuation
ValueCountFrequency (%)
(489
100.0%
Close Punctuation
ValueCountFrequency (%)
)489
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin4068130
78.7%
Common1098450
 
21.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e440957
 
10.8%
a316666
 
7.8%
n302072
 
7.4%
r275016
 
6.8%
l215035
 
5.3%
i197068
 
4.8%
t196119
 
4.8%
d173033
 
4.3%
s154762
 
3.8%
o146243
 
3.6%
Other values (40)1651159
40.6%
Common
ValueCountFrequency (%)
1033092
94.0%
-26841
 
2.4%
/19968
 
1.8%
"13922
 
1.3%
,790
 
0.1%
&674
 
0.1%
1612
 
0.1%
2612
 
0.1%
.584
 
0.1%
(489
 
< 0.1%
Other values (3)866
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII5166580
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1033092
20.0%
e440957
 
8.5%
a316666
 
6.1%
n302072
 
5.8%
r275016
 
5.3%
l215035
 
4.2%
i197068
 
3.8%
t196119
 
3.8%
d173033
 
3.3%
s154762
 
3.0%
Other values (53)1862760
36.1%

text_1
Categorical

HIGH CARDINALITY
MISSING

Distinct1423
Distinct (%)0.5%
Missing107992
Missing (%)27.0%
Memory size24.4 MiB
REGULAR INSTRUCTION
64896 
EMPLOYEE BENEFITS
32669 
INSTRUCTIONAL STAFF
30592 
REGULAR PAY
18609 
SPECIAL EDUCATION
 
9778
Other values (1418)
135741 

Length

Max length45
Median length19
Mean length18.83507535
Min length1

Characters and Unicode

Total characters5505210
Distinct characters68
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique138 ?
Unique (%)< 0.1%

Sample

1st rowBUILDING IMPROVEMENT SERVICES
2nd rowREGULAR INSTRUCTION
3rd rowREGULAR INSTRUCTION
4th rowEMPLOYEE BENEFITS
5th rowUNDESIGNATED

Common Values

ValueCountFrequency (%)
REGULAR INSTRUCTION64896
16.2%
EMPLOYEE BENEFITS32669
 
8.2%
INSTRUCTIONAL STAFF30592
 
7.6%
REGULAR PAY18609
 
4.6%
SPECIAL EDUCATION9778
 
2.4%
OPERATION AND MAINT OF PLANT9161
 
2.3%
FOOD SERVICES OPERATIONS5857
 
1.5%
SCHOOL ADMINISTRATION5575
 
1.4%
STUDENTS4864
 
1.2%
CENTRAL4858
 
1.2%
Other values (1413)105426
26.3%
(Missing)107992
27.0%

Length

2021-09-28T14:56:16.079113image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
regular86218
 
13.0%
instruction65024
 
9.8%
employee32669
 
4.9%
benefits32669
 
4.9%
staff30747
 
4.6%
instructional30617
 
4.6%
pay18856
 
2.8%
title17787
 
2.7%
i15652
 
2.4%
school15243
 
2.3%
Other values (1695)320148
48.1%

Most occurring characters

ValueCountFrequency (%)
655466
11.9%
E506450
 
9.2%
T495636
 
9.0%
I453280
 
8.2%
N403069
 
7.3%
R393286
 
7.1%
A389688
 
7.1%
O319812
 
5.8%
S315761
 
5.7%
L273787
 
5.0%
Other values (58)1298975
23.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter4810426
87.4%
Space Separator655466
 
11.9%
Dash Punctuation18019
 
0.3%
Lowercase Letter12257
 
0.2%
Other Punctuation7873
 
0.1%
Math Symbol778
 
< 0.1%
Open Punctuation165
 
< 0.1%
Close Punctuation145
 
< 0.1%
Decimal Number81
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E506450
10.5%
T495636
10.3%
I453280
9.4%
N403069
 
8.4%
R393286
 
8.2%
A389688
 
8.1%
O319812
 
6.6%
S315761
 
6.6%
L273787
 
5.7%
U244853
 
5.1%
Other values (16)1014804
21.1%
Lowercase Letter
ValueCountFrequency (%)
t2561
20.9%
s1753
14.3%
o1653
13.5%
r988
 
8.1%
g903
 
7.4%
a695
 
5.7%
i636
 
5.2%
e630
 
5.1%
n574
 
4.7%
l340
 
2.8%
Other values (14)1524
12.4%
Other Punctuation
ValueCountFrequency (%)
/4833
61.4%
&1043
 
13.2%
,699
 
8.9%
.661
 
8.4%
:403
 
5.1%
'108
 
1.4%
@102
 
1.3%
;23
 
0.3%
?1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
339
48.1%
219
23.5%
112
 
14.8%
611
 
13.6%
Space Separator
ValueCountFrequency (%)
655466
100.0%
Dash Punctuation
ValueCountFrequency (%)
-18019
100.0%
Math Symbol
ValueCountFrequency (%)
+778
100.0%
Open Punctuation
ValueCountFrequency (%)
(165
100.0%
Close Punctuation
ValueCountFrequency (%)
)145
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin4822683
87.6%
Common682527
 
12.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
E506450
10.5%
T495636
10.3%
I453280
9.4%
N403069
 
8.4%
R393286
 
8.2%
A389688
 
8.1%
O319812
 
6.6%
S315761
 
6.5%
L273787
 
5.7%
U244853
 
5.1%
Other values (40)1027061
21.3%
Common
ValueCountFrequency (%)
655466
96.0%
-18019
 
2.6%
/4833
 
0.7%
&1043
 
0.2%
+778
 
0.1%
,699
 
0.1%
.661
 
0.1%
:403
 
0.1%
(165
 
< 0.1%
)145
 
< 0.1%
Other values (8)315
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII5505210
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
655466
11.9%
E506450
 
9.2%
T495636
 
9.0%
I453280
 
8.2%
N403069
 
7.3%
R393286
 
7.1%
A389688
 
7.1%
O319812
 
5.8%
S315761
 
5.7%
L273787
 
5.0%
Other values (58)1298975
23.6%

Interactions

2021-09-28T14:56:01.849545image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
2021-09-28T14:56:01.947737image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
2021-09-28T14:56:02.043227image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
2021-09-28T14:56:02.165426image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/

Correlations

2021-09-28T14:56:16.163692image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2021-09-28T14:56:16.251662image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2021-09-28T14:56:16.338167image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2021-09-28T14:56:16.452791image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.
2021-09-28T14:56:16.594359image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

2021-09-28T14:56:03.494100image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
A simple visualization of nullity by column.
2021-09-28T14:56:04.608886image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2021-09-28T14:56:08.322804image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
2021-09-28T14:56:09.072593image/svg+xmlMatplotlib v3.4.2, https://matplotlib.org/
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

functionusesharingreportingstudent_typeposition_typeobject_typepre_koperating_statusobject_descriptiontext_2subfund_descriptionjob_title_descriptiontext_3text_4sub_object_descriptionlocation_descriptionftefunction_descriptionfacility_or_departmentposition_extratotalprogram_descriptionfund_descriptiontext_1
0Teacher CompensationInstructionSchool ReportedSchoolNO_LABELTeacherNO_LABELNO_LABELPreK-12 OperatingNaNNaNNaNTeacher-ElementaryNaNNaNNaNNaN1.0NaNNaNKINDERGARTEN50471.810KINDERGARTENGeneral FundNaN
1NO_LABELNO_LABELNO_LABELNO_LABELNO_LABELNO_LABELNO_LABELNO_LABELNon-OperatingCONTRACTOR SERVICESBOND EXPENDITURESBUILDING FUND(blank)RegularNaNNaNNaNNaNRGN GOBNaNUNDESIGNATED3477.860BUILDING IMPROVEMENT SERVICESNaNBUILDING IMPROVEMENT SERVICES
2Teacher CompensationInstructionSchool ReportedSchoolUnspecifiedTeacherBase Salary/CompensationNon PreKPreK-12 OperatingPersonal Services - TeachersNaNNaNTCHER 2ND GRADENaNRegular InstructionNaNNaN1.0NaNNaNTEACHER62237.130Instruction - RegularGeneral Purpose SchoolNaN
3Substitute CompensationInstructionSchool ReportedSchoolUnspecifiedSubstituteBenefitsNO_LABELPreK-12 OperatingEMPLOYEE BENEFITSTEACHER SUBSGENERAL FUNDTeacher, Short Term SubRegularNaNNaNNaNNaNUNALLOC BUDGETS/SCHOOLSNaNPROFESSIONAL-INSTRUCTIONAL22.300GENERAL MIDDLE/JUNIOR HIGH SCHNaNREGULAR INSTRUCTION
4Substitute CompensationInstructionSchool ReportedSchoolUnspecifiedTeacherSubstitute CompensationNO_LABELPreK-12 OperatingTEACHER COVERAGE FOR TEACHERTEACHER SUBSGENERAL FUNDTeacher, Secondary (High)AlternativeNaNNaNNaNNaNNON-PROJECTNaNPROFESSIONAL-INSTRUCTIONAL54.166GENERAL HIGH SCHOOL EDUCATIONNaNREGULAR INSTRUCTION
5Facilities & MaintenanceO&MSchool ReportedSchoolUnspecifiedCustodianBenefitsNO_LABELPreK-12 OperatingCONTRA BENEFITSNaNGENERAL FUNDCustodian - PT - JobsNaNNaNNaNNaNNaNNON-PROJECTNaNUNDESIGNATED-8.150EMPLOYEE BENEFITSNaNEMPLOYEE BENEFITS
6Instructional Materials & SuppliesInstructionSchool ReportedSchoolSpecial EducationNon-PositionSupplies/MaterialsNO_LABELPreK-12 OperatingEDUCATIONALSPECIAL EDUCATION INSTRUCTIONLOCALNaNNaNNaNNaNNaNNaNNaNNaNSUPPLIES AND MATERIALS2000.050SPECIAL EDUCATION LOCALLOCAL FUNDNaN
7Food ServicesO&MSchool on Central BudgetsNon-SchoolUnspecifiedCoordinator/ManagerBenefitsNO_LABELPreK-12 OperatingEMPLOYEE BENEFITSNaNGENERAL FUNDSub Manager, Food ServiceNaNNaNNaNDISTRICT WIDE ORGANIZATION UNINaNNON-PROJECTNaNUNDESIGNATED0.720UNDESIGNATEDNaNUNDESIGNATED
8Teacher CompensationInstructionSchool ReportedSchoolUnspecifiedTeacherBenefitsNO_LABELPreK-12 OperatingEMPLOYEE BENEFITSNaNGENERAL FUNDTeacher, ElementaryRegularNaNNaNNaNNaNELA S - TEACHING SPANISH ONLYNaNPROFESSIONAL-INSTRUCTIONAL228.250GENERAL ELEMENTARY EDUCATIONNaNREGULAR INSTRUCTION
9Substitute CompensationInstructionSchool ReportedSchoolUnspecifiedSubstituteBenefitsNO_LABELPreK-12 OperatingEMPLOYEE BENEFITSTEACHER SUBSGENERAL FUNDTeacher,Retrd Shrt Term SubRegularNaNNaNNaNNaNUNALLOC BUDGETS/SCHOOLSNaNPROFESSIONAL-INSTRUCTIONAL69.560GENERAL ELEMENTARY EDUCATIONNaNREGULAR INSTRUCTION

Last rows

functionusesharingreportingstudent_typeposition_typeobject_typepre_koperating_statusobject_descriptiontext_2subfund_descriptionjob_title_descriptiontext_3text_4sub_object_descriptionlocation_descriptionftefunction_descriptionfacility_or_departmentposition_extratotalprogram_descriptionfund_descriptiontext_1
400267NO_LABELNO_LABELNO_LABELNO_LABELNO_LABELNO_LABELNO_LABELNO_LABELNon-OperatingADDITIONAL/EXTRA DUTY PAY/STIPNaNBONDSecondary Spec Ed ParaTurnaroundNaNNaNNaNNaNNS GOBNaNPARAPROFESSIONAL157.654000GENERAL MIDDLE/JUNIOR HIGH SCHNaNREGULAR INSTRUCTION
400268Substitute CompensationInstructionSchool ReportedSchoolUnspecifiedSubstituteSubstitute CompensationNon PreKPreK-12 OperatingPersonal Services - Substitute Teachers CertifiedNaNNaNNaNNaNRegular InstructionNaNNaN0.00000NaNNaNSUBSTITUTE TEACHER141.939900Instruction - RegularGeneral Purpose SchoolNaN
400269Food ServicesO&MShared ServicesSchoolUnspecifiedOtherBase Salary/CompensationNon PreKPreK-12 OperatingSalaries Or Wages For Support PersonnelNaNChild NutritionCAFETERIA EMPLOYEE, FOOD SERV.NaNUndistributedSalaries Or Wages For Support PersonnelSchool0.09000Food Services (Child Nutrition Fund Only)Child NutritionREG FOOD SERVICE WORKER2293.931767UndistributedNational School Breakfast And Lunch ProgramREGULAR PAY
400270NO_LABELNO_LABELNO_LABELNO_LABELNO_LABELNO_LABELNO_LABELNO_LABELNon-OperatingOther Awards and PrizesNaNSupport Services - Instructional StaffNaNNaNNaNAwards And Prizes *NaNNaNOther Improvements Of Instruction Services *NaNNaN-390.710000NaNMiscellaneous State GrantsSCHOOL IMPROV INCENTIVES
400271Teacher CompensationInstructionSchool on Central BudgetsNon-SchoolUnspecifiedTeacherOther Compensation/StipendNO_LABELPreK-12 OperatingADDITIONAL/EXTRA DUTY PAY/STIPMATH/SCIENCEFEDERAL GDPG FUND - FYTeacher, ElementaryNaNNaNNaNMATH / SCIENCENaNTITLE II-PART A-TEACHER QUALITNaNPROFESSIONAL-INSTRUCTIONAL283.988640INSTRUCTIONAL STAFF TRAININGNaNINSTRUCTIONAL STAFF
400272Professional DevelopmentISPDShared ServicesNon-SchoolUnspecifiedInstructional CoachOther Compensation/StipendNO_LABELPreK-12 OperatingWORKSHOP PARTICIPANTNaNNaNCURRICULUM RESOURCE TEACHERNaNNaNNaNSTAFF DEV AND INSTR MEDIANaNINST STAFF TRAINING SVCSNaNNaN48.620000NaNGENERAL FUNDSTAFF DEV AND INSTR MEDIA
400273Substitute CompensationInstructionSchool ReportedSchoolUnspecifiedSubstituteBase Salary/CompensationNO_LABELPreK-12 OperatingSALARIES OF PART TIME EMPLOYEENaNFEDERAL GDPG FUND - FYTeacher,Retrd Shrt Term SubRegularNaNNaNNaN0.00431TITLE II,DNaNPROFESSIONAL-INSTRUCTIONAL128.824985INSTRUCTIONAL STAFF TRAININGNaNINSTRUCTIONAL STAFF
400274Parent & Community RelationsNO_LABELSchool ReportedSchoolNO_LABELOtherNO_LABELNO_LABELPreK-12 OperatingNaNNaNNaNSchool LiaisonNaNNaNNaNNaN1.00000NaNNaNPARENT/TITLE I4902.290000MiscSchoolwide SchoolsNaN
400275Library & MediaInstructionSchool on Central BudgetsNon-SchoolUnspecifiedLibrarianBenefitsNO_LABELPreK-12 OperatingEMPLOYEE BENEFITSEDUCATIONAL RESOURCE SERVICESLEVY OVERRIDELibrary Technician IINaNNaNNaNED RESOURCE SERVICESNaNNON-PROJECTNaNOFFICE/ADMINISTRATIVE SUPPORT4020.290000MEDIA SUPPORT SERVICESNaNINSTRUCTIONAL STAFF
400276Substitute CompensationInstructionSchool ReportedSchoolPovertySubstituteSubstitute CompensationNon PreKPreK-12 OperatingSalaries And Wages For Substitute ProfessionalsNaN"Title Part A Improving Basic Programs"TEACHER SUBSTITUTE POOLNaNMultilingual Dist Prof DevelopmentInservice Substitute Teachers Grant FundedSchoolNaNInstructionInstruction And CurriculumCERTIFIED SUBSTITUTE46.530000Accelerated Education"Title Part A Improving Basic Programs"MISCELLANEOUS